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1.  Executive  Summary 


This  report  describes  progress  and  results  on  the  grant,  "Photonic  Technology  for 
Implementation  of  Generalizable  Neural  Networks:  A  Synthetic  Approach", 
DARPA/ AFOSR  Grant  No.  AFOSR-69-0466,  for  the  annual  reporting  period  30  Sept 
91  -  29  Sept  92,  and  for  the  final  reporting  period  30  Sept  89  -  15  Sept  93.  Substantial 
progress  and  results  have  been  achieved  in  the  areas  of  neural  network  models  for 
optoelectronic  systems,  photonic  neural  network  architectures,  incoherent/coherent 
holographic  interconnections,  optical  disk  spatial  light  modulators  with  parallel 
readout  (as  input  devices),  and  high  bandwidth  spatial  light  modulators  (for  neuron 
unit  arrays)  including  mutually  compatible  silicon  detectors,  silicon  electronics,  and 
high  contrast  ratio  InGaAs/GaAs  modulators. 

Our  approach  has  been  vertically  integrated  in  that  it  synthesizes  materials, 
devices,  architectures,  and  algorithms.  The  effort  has  been  centered  around  a  novel 
class  of  photonic  architectures  that  provide  for  generalizability  to  different  neuron 
and  neural  network  models,  modularity,  fully  parallel  input/output,  and  scalability 
to  large  numbers  of  neuron  units  and  interconnections.  The  architectures 
incorporate  optoelectronic  spatial  light  modulators  for  flexible  neuron  unit 
functionality  by  means  of  integrated  silicon  electronics,  as  well  as  silicon  detectors 
and  GaAs-based  modulator  elements  for  highly  parallel  input  to,  and  output  from, 
each  neuron  unit  array.  Each  pixel  of  this  neuron  unit  array  includes  dual  channel 
inputs  and  outputs  to  enable  bipolar  functionality.  For  consistency  among  the 
algorithm/model  level,  the  architecture  level,  and  the  hardware  component  and 
materials  levels,  we  have  developed  a  multilayer  neural  network  model  and 
learning  algorithm  based  on  this  dual  channel  signal  representation.  Furthermore, 
we  have  maintained  compatibility  of  all  hardware  architectures,  devices,  and 
materials  with  this  model. 

One  of  the  key  results  of  this  effort  has  been  the  simultaneous  development 
of  mutually  compatible  high-contrast  GaAs-based  modulator  element  arrays  for 
neuron  unit  output,  silicon  detector  and  electronic  circuitry  arrays  for  neuron  unit 
input  and  signal  processing,  and  multilayer  neural  network  models  and  learning 
algorithms.  Both  device  arrays  have  been  fabricated  and  demonstrated  to  yield  high 
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contrast,  high  bandwidth,  high  density,  and  dua’  channel  capability  that  is 
compatible  with  the  dual  channel  neural  network  model  mentioned  above. 

A  two  dimensional  array  of  surface-emitting  semiconductor  laser  diodes  can 
provide  a  compact  implementation  of  the  requisite  sources;  for  laboratory  purposes 
we  have  utilized  other  novel,  programmable  techniques  for  generation  of  source 
arrays  at  visible  as  well  as  infrared  wavelengths.  In  addition,  since  scalability  places 
demands  not  only  on  the  interconnection  and  the  neuron  unit  arrays,  but  also  on 
the  input  device,  we  have  investigated  the  use  of  an  optical  disk  spatial  light 
modulator  that  provides  parallel  readout  of  a  large  number  of  stored  input  training 
patterns  at  a  high  frame  rate. 

Central  to  our  photonic  architecture  is  a  novel  volume  holographic 
interconnection  technique  that  is  based  on  an  array  of  individually  coherent  but 
mutually  incoherent  sources.  Results  from  our  experimental,  computational,  and 
analytical  investigations  of  the  properties  of  this  interconnection  system  have  been 
a  key  impetus  in  the  pursuit  of  this  overall  work,  and  have  shown  the 
interconnection  system  to  be  exceptionally  low  in  crosstalk,  despite  highly  parallel 
recording  and  readout  J  the  volume  holographic  element.  This  photonic 
interconnection  architecture  exhibits  another  unique  property:  once  trained,  the 
stored  weighted-interconnection  pattern  can  be  copied  in  a  single  exposure  step  to 
another  volume  holographic  medium,  thus  permitting  duplication  of  a  pretrained 
neural  network  in  a  manufacturing  environment. 
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2.  Summary  of  Approach,  Progress,  and  Results 


This  section  gives  an  overview  of  our  approach,  progress,  and  results. 
Detailed  descriptions  are  given  in  the  attached  papers  and  in  the  previous  annual 
reports,  including  the  papers  attached  therein.  A  list  of  accoii.plishments  is  given  in 
Section  3. 

2.1  Introduction 

A  general  neural  network  architecture  should  have  the  following  features: 
(1)  modularity,  i.e.,  be  in  the  form  of  a  cascadable  "module";  (2)  capability  for  lateral, 
feedforward,  and  feedback  interconnections;  (3)  analog  weighted  interconnections; 
(4)  bipolar  signals  and  weights;  (5)  scalability  to  large  numbers  of  neuron  units  with 
high  connectivity;  (6)  generalizability  to  different  network  models  and  learning 
algorithms,  as  well  as  capability  for  extension  to  possible  future  network  models. 

Recently,  optical  and  optoelectronic  implementations  of  neural  networks 
have  been  the  focus  of  intensive  research  and  development  (see  for  example.  Ref.  1). 
In  our  effort  we  have  developed  a  novel  implementation  technique  that  provides 
for  most  if  not  all  of  the  above  properties.  Our  photonic  architecture^  utilizes  a 
novel  incoherent/coherent  hologram  recording  and  reconstruction  technique  for 
highly  parallel  operation  with  reduced  crosstalk.  High  bandwidth  optoelectronic 
spatial  light  modulators  with  integrated  detectors,  modulators,  and  control 
electronics  at  each  pixel  are  being  fabricated  for  the  2-D  neuron  unit  arrays. 

2.2  Photonic  Architecture 

A  broad  class  of  learning  algorithms  can  be  represented  by  the  weight  update 
equation: 


Awij  =  a6iXj  -  pwij 
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in  which  wjj  is  the  interconnection  weight  fron\  j  to  i,  Awjj  =  Awij(k+1)  -  wij(k)  is  the 
weight  update,  a  is  the  learning  gain  constant,  Xj  is  the  signal  level  of  the  input 
(e.g.,  neuron  unit  of  the  previous  layer  in  a  multilayer  net),  and  P  is  the  decay 
constant.  Suitable  choices  of  5i  give  different  learning  algorithms,  such  as  Hebbian, 
Widrow-Hoff,  and  back  propagation. 

In  our  photonic  architecture  the  neural  interconnections  as  well  as  the  weight 
updates  that  train  the  interconnections  are  formed  within  a  photorefractive  crystal. 
The  fanout  interconnection  pattern  from  one  neuron  unit  is  implemented  as  a 
single  hologram,  of  which  are  multiplexed  into  a  single  volume  holographic 
medium.  The  last  term  in  the  above  equation  is  an  optional  decay  term  that  is 
included  primarily  to  model  intentional  or  unintentional  decay  of  holograms  in 
such  a  photorefractive  crystal.  (Other  physical  effects  such  as  nonlinearities  and 
saturation  in  the  medium  are  considered  entirely  unintentional  and  are  treated 
separately.) 

Two-dimensional  optoelectronic  Ga As-based  spatial  light  modulator  (SLM) 
arrays,  with  integrated  detectors,  modulators  and  control  electronics,  implement 
inner  product  neuron  units,  as  well  as  the  training  plane  array,  which  generates  the 
5i  terms.  This  technology  provides  for:  (1)  incorporation  of  bipolar  signals  via  dual¬ 
channel  inputs  and  outputs;  (2)  slight  variants  of  the  same  basic  SLM  structure  for 
all  SLM's  in  the  architecture  (for  neuron  units  and  5,  generation);  (3)  incorporation 
of  different  neuron  unit  functions,  including  linear,  soft  threshold,  and  hard 
clipping,  as  well  as  variable  gain;  and  (4)  potential  extendibility  to  future  neural 
network  models. 

The  photonic  architecture  for  the  case  of  Hebb’an  learning,  5i  =  y,,  where  yi  is 
the  output  of  neuron  unit  i,  is  shown  in  Fig.  1.  (For  other  learning  algorithms, 
SLM]  is  replaced  by  one  or  more  SLM's  with  appropriate  functions,  and  for 
supervised  learning  another  beam  representing  desired  outputs  is  incident  on  the 
SLM.)  Only  feedforward  connections  are  shown. 

The  neuron  unit  arrays  mentioned  above  are  incorporated  by  means  of  two 
spatial  light  modulators  (SLM's).  SLM2  represents  the  array  of  input  units  to  the 
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current  optical  module,  and  SLMi  represents  the  array  of  output  neuron  units  of  the 
current  module  (yielding  outputs  of  the  neural  net  or  providing  the  input  neuron 
units  of  the  next  module).  An  array  of  mutually  incoherent  coherent  sources  is 
used  to  illuminate  the  system;  these  sources  are  provided  by  a  mutually  incoherent 
laser  diode  array,  or  by  a  coherent  beam  passing  through  an  optical  system  that 
generates  a  set  of  beams,  each  temporally  modulated  or  frequency  shifted  sufficiently 
to  provide  mutual  incoherence  over  the  detection  time  constant. 

During  the  computation  phase,  the  shutter  is  closed  to  prevent  learning.  The 
array  of  sources  is  imaged  onto  SLM2  as  a  set  of  read  beams.  Fach  beam  is 
modulated  by  SLM2  to  provide  the  inputs.  Then  each  (reference)  beam  Xj  passes 
through  the  hologram  to  provide  the  interconnections,  i.e.,  the  weighted  fanout 
from  each  input  Xj.  The  hologram  output  is  sent  to  the  write  side  of  SLMi  (i.e.,  the 
virtual  image  of  SLMi  from  the  reconstruction  is  imaged  onto  SLMi  via  the  lens). 
Optical  fan-in  of  multiple  holographic  reconstructions  at  this  plane  provides  linear 
incoherent  summation  of  weighted  interconnections  at  each  neuron  unit  input. 
The  dove  prism  reflects  the  images  so  that  their  orientation  at  SLMi  is  consistent. 

In  the  learning  phase,  the  shutter  is  open.  Each  weight  update  term  is 
computed  optically  and  recorded  into  the  photorefractive  material.  Light  from  each 
source  is  approximately  collimated  and  used  as  the  read  beam  for  SLMi.  Thus,  for 
an  N  by  N  array  of  sources,  there  are  beams  reading  SLMi  simultaneously,  each 
at  a  different  angle,  as  shown  in  Fig.  2;  all  yi  terms  are  encoded  onto  each  of  these 
beams.  This  novel  angular  multiplexing  technique  eliminates  beam  degeneracy, 
thereby  reducing  crosstalk  during  readout  compared  with  more  conventional 
single-source  fully  coherent  recording  techniques^.  Each  of  these  beams  then 
interferes  only  with  its  corresponding  reference  beam,  xj,  from  the  same  source,  in 
the  photorefractive  material.  This  process  writes  the  set  of  desired  weight  update 
terms  axjyi. 

The  central  architectural  feature  of  incoherent! coherent  double  angularly 
multiplexed  hologram  recording  and  readout  has  been  designea  to  satisfy  a  number 
of  constraints:  minimization  of  inherent  recording-induced  crosstalk;  minimization 
of  inherent  readout-induced  (fan-in)  crosstalk;  incoherent  (additive)  summation  of 


7 


Figure  2.  Interconnection  system  of  the  photonic  architecture,  showing 
incoherent/coherent  double  angular  multiplexing  geometry.  The  dashed  beam 
corresponds  to  a  different  source  than  the  solid  beam.  The  two  beams  are  mutually 
incoherent  and  are  incident  on  the  upper  SLM  at  different  angles. 


interconnection  weighted  inputs;  optimization  of  the  overall  interconnection 
throughput  efficiency;  and  minimization  of  recording  nonidealities  introduced  by 
the  use  of  photorefractive  media  for  grating  formation.  The  reconstruction 
accuracy,  residual  crosstalk,  and  overall  throughput  efficiency  of  such  an  N^-N^ 
interconnection  in  the  case  of  a  linear  holographic  medium  has  been  evaluated  by 
numerical  simulation  using  the  optical  beam  propagation  method  (BPM)3.  Results 
of  a  comparison  between  our  proposed  architecture  and  other  candidate  methods 
such  as  single-source  fully  coherent  simultaneous  recording  show  that  our  proposed 
method  exhibits  dramatically  reduced  crosstalk  as  well  as  significantly  increased 
optical  throughput.  In  addition,  experiments  in  photorefractive  and  fixed 
holographic  materials  that  we  have  performed  to  date  support  the  viability  of  this 
approach  to  the  implementation  of  highly  multiplexed  photonic  neural  network 
interconnections. 

This  photonic  architecture  potentially  scales  to  10^®  interconnections  in  one 
cascadable  module,  incorporating  10^  neuron  units  running  at  (for  the  computing 
phases)  10-50  MHz,  with  fully  parallel  (10^  lines)  input/output.  Our  technology 
development  program  has  been  directed  toward  the  rapid  realization  of  these  goals, 
and  our  results  have  indicated  that  these  goals  are  feasible. 

2.3  Photonic  Components 

In  order  to  assure  system  functionality,  the  requisite  components 
incorporated  in  such  an  architecture  must  be  designed  from  a  synthetic  viewpoint, 
such  that  their  resulting  performance  parameters  and  characteristics  are  mutually 
compatible.  In  this  section,  the  required  features,  design  constraints,  and 
achievements  in  the  implementation  of  each  essential  component  in  the 
architecture  are  summarized. 

The  detection,  amplification,  neuron  unit  response,  and  modulation 
functions  required  in  both  the  neuron  unit  and  training  planes  are  incorporated  in 
multifunction  spatial  light  modulators  (SLM's).  A  dual  rail  differential  approach  is 
used  because  of  its  inherent  capability  to  accommodate  both  bipolar  inputs  and 
outputs.  The  simpler  case  of  unipolar  outputs  and  bipolar  inputs,  also  common  in 
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neural  network  models,  represents  a  subset  of  our  fully  bipolar  design  and  requires 
even  less  chip  area.  The  dual  rail  approach  requires  the  hybrid  or  monolithic 
integration  of  two  detectors,  appropriate  amplification  and  control  circuitry,  and  two 
modulators  within  each  SLM  pixel.  A  primary  goal  has  been  to  develop  analog 
electronic  circuitry  that  is  process  compatible  with  both  detector  and  modulator 
requirements,  and  at  the  same  time  utilizes  minimum  real  estate. 

Our  design  incorporates  silicon  technology  for  detection  and  analog  electronic 
neuron  unit  processing,  and  GaAs-based  strained-layer  multiple  quantum  well 
technology  for  modulation.  As  shown  in  Fig.  3,  by  flip-chip  bonding  two  such  chips 
face-to-face  and  operating  at  a  wavelength  for  which  the  GaAs  semi-insulating 
substrate  is  transparent,  electrical  contact  between  the  two  chips  is  provided  at  each 
pixel  without  either  through-substrate  vias  or  significant  losses  in  chip  area, 
bandwidth,  or  reliability.  The  design  specification  has  used  100  pm  X  100  pm  pixels, 
divided  into  separate  regions  for  the  detection,  modulation,  and  electronic  circuitry. 

Fabrication  of  6  X  6  arrays  in  silicon  have  shown  sigmoidal  response  with  full 
dual  rail  input  and  output  capability,  excellent  uniformity  over  the  array,  and 
bandwidths  of  each  pixel  on  the  order  of  10  MHz.  Fabrication  of  InGaAs/GaAs 
Fabry-Perot  cavity  multiple  quantum  well  reflective  modulators  in  2-D  arrays  have 
yielded  a  contrast  ratio  of  66:1  with  a  dynamic  range  of  30%  for  conventional  cavity 
geometries^,  and  a  contrast  ratio  of  30:1  with  a  dynamic  range  of  50%  over  a  2-D 
array  in  an  inverted  cavity  geometry.  This  inverted  cavity  geometry^  uses  an 
external  high-reflectivity  cavity  mirror,  so  that  the  modulator  can  be  read  out 
through  the  GaAs  substrate,  as  required  for  flip-chip  bonding  to  the  silicon  chip. 

The  two-dimensional  array  of  sources  must  incorporate  constant  amplitude, 
individually  coherent  but  mutually  incoherent  source  elements.  To  assure  mutual 
incoherence,  the  mutual  coherence  times  between  any  pair  of  individual  sources 
need  only  be  less  than  the  grating  recording  time  constant  of  the  holographic 
interconnection  and  the  time  constant  of  the  neural  detection  circuitry.  We  give 
two  examples  of  such  source  arrays.  Two-dimensional  arrays  of  surface  emitting 
laser  diodes  have  been  fabricated  by  researchers  at  AT&T  and  Bellcore^,  at  a  density 
of  2  X  10^  lasers  per  cm^.  Such  lasers  will  be  primarily  mutually  incoherent  by  the 
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Readout  Write  Antireflection 


Contact  Substrate 


Figure  3.  Cross  section  of  one  pixel  of  spatial  light  modulator  structure,  showing 
silicon  detector  and  silicon  analog  electronic  circuitry  flip-chip  bonded  to  GaAs- 
based  reflection  modulator.  Designed  wavelength  of  operation  is  approximately  950 
nm. 


very  nature  of  nonuniformities  inherent  in  the  fabrication  procest  For  laboratory 
demonstrations  in  the  visible,  we  have  fabricated  acousto-optic  source-array 
systems,  each  of  which  generates  a  set  of  beams  from  one  coherent  laser  source. 
Each  beam  of  the  set  of  beams  is  Doppler  shifted  by  a  different  amount,  yielding 
effective  mutual  incoherence.  This  acousto-optic  technique  has  been  used 
successfully  to  record  a  set  of  holograms  in  photorefractive  materials  using  the 
incoherent /coherent  double  angular  multiplexing  technique  described  above. 

In  conclusion,  this  research  and  development  program  encompasses  the 
interdisciplinary  areas  of  neural  models,  photonic  architectures,  photonic 
components,  and  photonic  materials  in  order  to  develop  a  consistent  technology  for 
implementation  of  large-scale  densely  interconnected  neural  networks.  Our  use  of 
an  array  of  mutually  incoherent  sources  provides  for  a  novel  incoherent/coherent 
double  angular  multiplexing  technique  for  low  crosstalk,  high  throughput 
interconnections;  hybrid  flip-chip  bonded  SLM’s  also  provide  for  high  speed  neuron 
unit  functionality  in  a  near-term  integrable  package.  We  have  also  developed 
neural  network  models  to  ensure  the  self-consistency  of  the  dual  rail  signal 
representation  for  neuron  units  and  the  corresponding  interconnection  functions. 
Our  future  plans  include  demonstration  of  the  hybrid  flip-chip  bonded  SLM  that 
implements  neuron  unit  arrays,  and  a  system-level  demonstration  of  neural 
network  operation  using  this  novel  photonic  architecture. 
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3.  List  of  Accomplishments 


3.1  Development  of  a  multilayer  learning  algorithm  that  exhibits  complete 
bipolar  functionality  but  can  be  implemented  with  only  nonnegative  signals. 
This  approach  permits  implementation  of  photonic  neural  networks  that  represent 
signals  as  (nonnegative)  intensity  levels  and  weights  as  (nonnegative)  diffraction 
efficiencies,  and  that  utilize  dual  channel  inputs  and  outputs  for  each  neuron  unit. 
In  our  simulations  (on  2-bit  and  3-bit  parity,  and  invariant  TC  recognition  problems) 
this  back-propagation  style  learning  algorithm  has  performed  comparably  to,  or 
better  than,  conventional  back- propagation  learning. 

3.2  Development,  refinement,  and  generalization  of  the  photonic  neural  network 
architecture.  Design  of  the  generic  architecture  has  been  completed,  and  variants  of 
the  architecture  for  different  neural  network  models,  including  multilayer  back 
propagation,  have  been  designed.  The  architecture  is  modular,  permits  fully 
parallel  input/output,  has  a  unique  interconnection  that  yields  crosstalk-free 
performance,  and  permits  rapid  copying  of  the  interconnection  pattern. 

3.3  Verification  and  characterization  of  incoherent/coherent  holographic  (fan- 
in/fan>out)  intercoimection  systems  via  experiment  and  simulation.  Simulation 
at  a  variety  of  interconnection  sizes,  including  10  inputs  to  10  outputs,  have  shown 
our  novel  (incoherent/coherent,  double  angularly  multiplexed)  he-lographic 
interconnection  to  exhibit  high  fidelity,  relatively  high  diffraction  efficiency,  and 
extremely  low  crosstalk.  In  addition,  experiments  in  photorefractive  bismuth 
silicon  oxide  (2  inputs  to  4096  outputs)  and  photorefractive  lithium  niobate  (15 
inputs  to  16,384  outputs)  yield  further  evidence  that  the  underlying  concepts  are 
sound.  This  interconnection  is  a  key  element  in  our  photonic  neural  network 
architecture. 

3.4  Design  and  fabrication  of  2-D  arrays  of  detectors  and  analog  circuitry  in 
silicon  for  neuron  unit  input  and  processing.  Six  by  six  arrays  of  dual  channel, 
sigmoidal  function  neuron  units  have  been  demonstrated  using  analog  silicon 
CMOS  (15  transistors  within  2500  square  microns  per  neuron  unit),  and  exhibit 
bandwidths  of  14  MHz  (small  signal)  and  4  MHz  (large  signal).  Measured  output 
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voltages  of  12  volts  are  sufficient  for  driving  the  Ga As-based  moc'ulators.  Dual 
channel  neuron  unit  circuitry  incorporating  analog  sample  and  hold  memory  has 
also  been  desi^^ned  and  fabricated.  For  the  neuron  unit  inputs,  a  variety  of  silicon 
photodetectors  have  been  fabricated  and  tested,  and  used  in  cascade  with  the  analog 
electronic  sigmoidal  function. 

3.5  Design  and  fabrication  of  2-D  arrays  of  optical  modulators  in  GaAs/lnGaAs 
for  neuron  unit  output.  Multiple  quantum  well  optical  modulator  arrays  have 
been  demonstrated  in  two  configurations:  an  asymmetric  cavity  Fabry-Perot 
(exhibiting  66:1  contrast  ratio  and  30%  dynamic  range),  and  an  inverted  asymmetric 
cavity  Fabry-Perot  (exhibiting  30:1  contrast  ratio  and  50%  dynamic  range).  Drive 
voltages,  at  12  volts,  are  compatible  with  the  silicon  chip  electronics.  The  950  nm 
wavelength  operating  regime  of  these  high  bandwidth  modulators  provides 
compatibility  with  surface  emitting  semiconductor  laser  diode  arrays,  and  due  to 
substrate  transparency  permits  direct  flip-chip  bump-contact  bonding  with  the 
silicon  chip. 

3.6  Generation  of  an  array  of  self  coherent  but  mutually  incoherent  sources.  An 
acousto-optic  deflector  based  system  has  been  designed,  built,  and  demonstrated,  that 
generates  a  linear  array  of  15  sources  from  a  single  laser  source.  A  second  system 
that  can  generate  a  2-D  array  of  up  to  128  X  128  souues  has  been  designed.  In  the  1  X 
15  system,  self  coherence  and  mutual  incoherence  have  been  demonstrated  by  the 
simultaneous  writing  and  individual  readout  of  15  independent  gratings  in 
photorefractive  lithium  niobate.  This  source  array  generator  can  be  used  with 
visible  as  well  as  infrared  laser  sources,  and  is  useful  for  providing  the  mutually 
incoherent  beams  required  in  the  photonic  neural  network  architecture. 

3.7  Demonstration  of  recording  and  high-speed  parallel  readout  of  2-D  images  on 
an  optical  disk,  with  potential  applications  to  presentation  of  training  patterns 
to  a  neural  network  during  learning.  Recording  and  fully  parallel  readout  of  gray- 
level  512  X  512  images  stored  on  an  optical  disk  has  been  demonstrated.  A  novel 
differential  interference  contrast  readout  technique  permits  direct,  optically  efficient 
readout  yielding  high  (50:1)  contrast  ratio  images.  This  "optical  disk  spatial  light 
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modulator"  can  be  used,  for  example,  to  provide  a  high  frame-rate  input  to  a 
photonic  neural  network. 
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Summary 

Volume  holography  has  often  been  proposed  as  an  interconnection  technology  for  photonic 
neural  network  implementations  [  1]  because  it  appears  to  potentially  meet  the  critical  requirements 
of  providing  both  large  numbers  of  interconnections  and  a  weight^  fan-out/fan-in  topology.  The 
feasibility  of  using  volume  holograms  for  large-scale  weighted  fan-out/fan-in  interconnection 
applications  depends  in  part  on  the  fidelity  with  which  the  interconnection  weights  can  be 
implemented,  and  on  the  throughput  that  can  be  achieved  in  the  volume  holographic 
interconnection  system.  In  addition,  an  important  implementation  issue  is  the  number  of  exposures 
required  to  record  a  particular  interconnection  configuration  because  this  can  impact  the  total 
writing  time  and  required  exposure  schedule. 

In  many  neural  network  algorithms,  the  interconnections  between  any  two  layers  can  be 
represented  as  p  =  Wx  in  which  each  component  of  the  vector  x  represents  the  output  of  a  neuron 
unit  in  the  input  layer,  W  is  the  interconnection  matrix,  and  each  component  of  the  vector  p  is  the 
input  to  a  neuron  unit  in  the  output  layer.  A  typically  proposed  volume  holographic 
interconnection  architecture  that  performs  this  interconnection  function  [1]  (hereafter  referred  to  as 
a  "conventional"  architecture)  is  shown  in  Fig.  1.  Each  interconnection  between  a  pixel  in  the 
input  plane  and  a  pixel  in  the  output  plane  (not  shown)  is  implemented  as  a  single  diffraction 
grating  in  the  holographic  recording  medium.  The  weight  of  the  interconnection  is  related  to  the 
relative  diffraction  efficiency  of  its  associated  grating. 

For  adaptive  neural  networks  in  which  a  learning  algorithm  is  implemented,  the  weight 
matrix  is  typically  formed  by  successively  presenting  training  patterns  (consisting  of  an  input 
image  at  the  input  plane  and  its  associated  training  image  at  the  training  plane)  to  the  system  until 
the  appropriate  error  criterion  is  minimized.  In  a  conventional  architecture,  there  are  three  methods 
to  record  the  desired  interconnection  gratings  during  the  presentation  of  a  single  training  pattern  to 
the  system.  In  the  first  method,  all  of  the  pixels  in  the  input  and  training  planes  are  turned  on 
simultaneously.  Since  the  light  from  each  pixel  is  mutually  coherent,  the  desired  interconnection 
gratings  are  formed.  In  addition,  undesired  gratings  are  created  that  form  intraplanar  connections 
amongst  the  pixels  of  both  the  input  plane  and  the  training  plane.  As  is  generally  recognized,  these 
additional  coupling  paths  introduce  a  serious  source  of  crosstalk  into  the  interconnection  system. 
By  recording  the  desired  gratings  in  a  page-wise  sequential  (i.e.,  a  single  pixel  in  the  input  plane 
simultaneously  with  all  of  the  training  plane  pixels)  or  fully  sequential  (i.e.,  a  single  input  pixel 
with  a  single  training  pixel)  manner,  some  or  all  of  these  coherent  crosstalk  gratings  may  be 
avoided.  The  cost,  however,  is  to  require  additional  exposures  compared  to  the  simultaneous 
recording  method.  For  an  interconnection  system  connecting  N  input  nodes  to  N  output  nodes  (an 
"N-N"  interconnection),  the  simultaneous  recording  method  requires  one  exposure  per  training 
pattern  whereas  the  page-wise  sequential  and  fully  sequential  methods  require  N  and 
exposures,  respectively.  For  large  numbers  of  interconnections  (the  situation  for  which 
holographic  interconnections  are  presumably  attractive),  this  could  result  in  impractically  long 
training  sessions. 
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We  quantitatively  examined  the  reconstruction  fidelity  and  throughput  of  the  conventional 
architecture  by  numerically  modeling  a  10-10  interconnection  system  using  the  optical  beam 
propagation  method.  A  linear  holographic  recording  medium  was  assumed  with  unlimited 
modulation  range.  Ten  training  patterns  of  random  intensities  were  used  to  generate  the  (nearly 
independent)  weight  matrix  recorded  in  the  holographic  medium.  For  this  choice  of  weights,  the 
RMS  error  of  the  diffracted  outputs  [2]  for  a  conventional  architecture  is  shown  in  Fig.  2a  for 
simultaneous  (with  R=l,  in  which  R  is  the  beamsplitter  ratio  in  Fig.  1),  page- wise  sequential 
(R=100),  and  fully  sequential  recording  (R=l).  The  horizontal  axis  represents  the  grating  strengfh 
(in  radians)  of  the  largest  interconnection  grating  recorded  in  the  medium.  The  interconnection  is 
readout  with  a  set  of  mutually  coherent  random  intensity  beams.  The  optical  throughput  is  shown 
in  Fig.  2b.  As  expected,  the  RMS  error  of  the  reconstructed  outputs  is  large  for  simultaneous 
recording.  There  is  little  improvement  in  the  error  in  going  to  page-wise  sequendal  recording  using 
a  beamsplitter  ratio  of  100.  This  can  be  understood  as  follows.  During  the  recording  of  a  single 
training  pattern,  the  crosstalk  gratings  connecting  the  pixels  within  the  training  plane  are  exposed  N 
times.  The  desired  gratings,  however,  each  receive  only  a  single  exposure.  When  the  undesired 
crosstalk  gratings  are  eliminated  using  fully  sequential  recording,  the  RMS  error  drops  dramatically 
such  that  it  is  still  relatively  low  at  a  peak  throughput  of  over  85%. 

An  alternative  volume  holographic  interconnection  architecture  has  recently  been  proposed 
[3]  that  requires  only  one  exposure  per  training  pattern  and  yet  minimizes  undesired  crosstalk 
gratings.  This  architecture  includes  an  optical  source  array  in  which  each  source  is  individually 
coherent,  but  mutually  incoherent  with  all  other  sources.  In  Fig.  3  we  show  a  modification  of  this 
architecture  that  for  implementation  in  certain  photorefiractive  materials  does  not  suffer  from  an 
inherent  throughput  loss  due  to  multiple  beam  superposition.  Using  lenses  Li  and  L2  (see  Fig.  3) 
a  set  of  subholograms  (which  may  partially  overlap)  are  recorded  in  the  holographic  medium. 
Each  subhologram  consists  of  a  1-N  fan-out  connecting  a  single  input  pixel  to  every  output  pixel 
(with  the  aid  of  an  imaging  lens  following  the  holographic  medium  [not  shown]). 

Numerical  modeling  results  for  this  multiple  source  incoherent/coherent  architecture  (for 
R  =  100)  are  shown  in  Fig.  4  (in  which  non-overlapping  subholograms  are  assumed).  The 
weight  matrix  and  random  inputs  are  the  same  as  used  above  for  the  conventional  architecture.  The 
RMS  error  of  the  diffracted  outputs  is  comparable  to  the  results  obtained  for  fully  sequential 
recording  in  a  conventional  architecture,  and  yet  the  multiple  source  incoherent/coherent 
architecture  requires  only  a  single  recording  per  training  pair. 

In  addition  to  requiring  fewer  exposures  to  achieve  the  same  level  of  fidelity  and  throughput 
as  obtained  for  fully  sequential  recording  in  a  conventional  architecture,  the  multiple  source 
incoherent/coherent  architecture  offers  several  further  advantages.  First,  the  weighted 
interconnections  stored  in  the  volume  holographic  medium  can  be  copied  to  another  volume 
holographic  medium  in  a  one-step  process  (which  is  useful,  for  example,  for  transferring  a  set  of 
learned  weights  from  a  dynamic  to  a  permanent  holographic  medium)  [4],  Second,  since  the 
source  array  shown  in  Fig.  3  consists  of  optical  sources  that  are  individually  coherent  but  mutually 
incoherent,  the  summation  at  the  output  nodes  is  performed  by  the  fan-in  of  mutually  incoherent 
beams.  This  eliminates  the  requirement  for  strict  positional  stability  of  the  optical  components 
during  readout  (which  is  required  in  a  conventional  architecture  for  readout  with  mutually  coherent 
beams).  No  degradation  in  throughput  (see  Fig.  4b)  is  suffered  with  the  use  of  mutually 
incoherent  beams  because  an  angular  fan-in  is  used  at  each  output  node. 
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PRE-PATTERNED  GaAs(lOO)  SUBSTRATES  VIA  MOLECULAR  BEAM  EPITAXY: 
APPLICATIONS  TO  LIGHT  MODULATORS  AND  DETECTORS 
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ABSTR.ACT 

We  repon  the  realization  of  high  quality  strained  InCaAs/GaAs  multiple  quantum 
wells  (MQW)  grown  on  planar  GaAs  (100)  substrates  through  optimization  of  molecular 
beam  epitaxical  (MBE)  growth  conditions  and  structure.  Such  MQWs  containing  ~1 1% 

In  have  lead  to  the  realization  of  an  asymmetric  Fabry-Perot  (ASFP)  reflection  modulator 
with  a  room  temperature  contrast  ratio  of  66:1  and  an  on-state  reflectivity  of  30%.  For  In 
composition  2  0.2.  the  improved  optical  quality  for  very  thick  (>2uni)  InGaAs/GaAs 
MQWs  grown  on  pre-pattemed  substrates  is  demonstrated  via  transmission  electron 
microscopy  (TE.\I)  and  micro-absorption  measurements. 


Strained  InGaAs/.AlGaAs  layered  structures  grown  on  GaAs(lOO)  substrates  have 
attracted  considerable  attention  for  their  application  in  vertical  cavity  surface  emitting  lasers  [1], 
resonant  tunneling  diodes  (RTD)  (2],  and  light  modulators  (3,4).  In  the  transmission  geometry, 
such  modulators  can  take  advantage  of  the  transparent  nature  of  the  GaAs  substrate  so  that, 
unlike  the  GaAs^AlGaAs  based  modulators,  there  is  no  need  for  a  patterned  removal  of  the  GaAs 
substrate.  Reflection  mode  .ASFP  modulators,  when  combined  '.'ith  RTDs  and  Si  field  effect 
transistors,  provide  optical  digital  switches  with  high  fan-out  useful  for  digital  optical 
computing  and  communication  networks  (5).  In  the  reflection  geometry,  inverted  reflection 
modulators  (6)  can  also  be  made  in  which  light  passes  i::  and  out  through  the  substrate  so  that  a 
Si-chip  containing  control  electronics  can  be  flip-chip  bonded  to  the  modulator  bearing  G«iAs 
chip  (7).  When  combined  with  arrays  of  strained  InGaAs/GaAs  lasers,  detectors  and  Si-control 
electronics,  analog  optical  neuron  units  can  be  realized. 

Unlike  strained  InGaAs/AlGa.As  quantum  well  lasers  and  RTDs,  which  require  relatively 
thin  active  layers,  the  major  requirement  for  light  modulators  is  to  grow  the  strained  active 

region  to  thicknesses  (-1.5um)  necessary  to  obtain  reasonable  optical  interaction  path  length. 

Through  control  of  the  growth  kinetics  via  fcCection  high-energy  electron  diffraction  (RHEED) 
pattern  and  intensity  dynamics  (S)  in  mclecuiar  beam  epitaxical  (MBE)  growth,  we  have 
realized  sharp  e.xciton  linewidths  it.  such  multiple  quantum  wells  and  thus  achieved  a  typical 

absorption  modulation  per  well,  (Aa  d^,)  of  ~  0.004  in  the  exciton  tail  region  at  relatively  low 
bias  (-  80  kV/cm)  (4).  Placing  these  strained  MQWs  in  an  ASFP  configuration  involving 

integrated  Bragg  mirrors  it  should  thus  be  possible  to  obtain  high  contrast  ratios.  Contrast  ratios 
of  >  100:1  have  been  demonstrated  [9,10]  in  the  AlGaAs/GaAs  system  but  have  been  limited  to 
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•'3:1  in  the  InGaAs/GaAs  system  [11].  The  high  contiast  latio  in  the  ASFP  configuration  is 

obtained  by  modulating,  say  elearically,  the  absorption  in  the  MQW  so  that  at  one  cf  the 
voltages  the  primary  reflected  beam  is  canceled  by  the  higher  order  reflected  beams  and  a  near 
zero  reflectivity  is  achieved.  Calculations  indicate  that  with  front  and  back  mirror  reflectivities 

of  0.68  and  0.99  respectively,  it  is  possible  to  reach  this  low  reflection  state  for  Aa-d^'N  (where 

N*numbcr  of  wells)  of  ~  0.15  when  operating  near  9500A  (In  content  -  12%).  For  higher  In 
composition  (>  20%>),  to  circumvent  the  problem  of  dislocation  generation  at  the  needed 
thtcimess,  we  have  applied  the  physical  idea  of  defect  reduction  through  growth  on  pre-pattemed 
substrates  [12,13].  Here  we  show  that  even  when  the  growth  on  the  planar  part  of  the 
substrate  exhibits  high  density  of  stmctural  defects  and  no  excitonic  features,  the  MQWs  grown 
on  the  pre-pattemed  part  (both  on  top  of  the  mesa  and,  surprisingly,  in  between  mesas)  still 
show  nearly  perfect  layering  and  sharp  excitonic  features  with  linewidths  comparable  to  those 
obtained  on  samples  of  tower  In  composition.  Using  this  defect  reduction  technique,  it  is  thus 
possible  to  grow  such  highly  strained  MQWs  with  integrated  Bragg  mirrors  to  realize  high 
performance  ASFP  modulators  operating  near  9800A  wavelength  made  important  by  the 
developments  in  Er  doped  optical  fibers  and  at  1.06pm  wavelength  of  interest  due  to  the 
availability  of  Nd;YAG  lasers. 


The  structures  were  grown  in  our  REBER  32P  MBE  system  on  cither  Cr-doped  semi- 
insulating  or  Si-doped  n*  GziAs  (100)  substrates.  The  temporal  behavior  of  the  RHEED  pattern 
and  the  specular  beam  intensity  was  employed  to  obtain  optimized  growth  conditions,  ensure 
reproducible  growth  conditions,  and  to  monitor  accurately  the  group  m  incorporation  rates. 


Fig.l  shows  the  electromodulation 
behavior  of  samples  RG900420  and 
RG900726  which  ^consist  of  50  period 
Ioq  1 1 GaQ  As(  1 00.A.)/GaAs(200A)  and  20 
period  Ino.i9GaQ  3jAs(80A)/GaAs(  145A) 
MQWs,  respectively,  grown  on  non-pattemed 
Ga.As(100)  substrates.  For  comparison,  data 
for  InGaAs/GaAs  MQWs  appearing  in  the 
literature  [14-16]  are  also  included.  The 
product  of  the  maximum  change  in  absorption 

coefficient  (Aa)  and  the  well  width  (d^), 
which  gives  moduls'ion  per  well,  is  plotted  for 
Aa  in  the  exciton  peak  and  tail  regions  as  a 
function  of  the  electric  field  with  the  built-in 
field  accounted  for.  The  room  temperature 
exciton  half  widths  at  half  maximum 
(HWHM)  for  RG900420  and  RG900726  are 
5.5meV  and  7.5meV,  respectively.  Sample 

RG900420  clearly  shows  very  large  Aa  d^  of 
~  0.004  in  the  e.xciton  tail  region  at  a  low 

electric  field  of  ~  60  kV/cm.  Not  shown  in  the 

figure,  a  sample  RG900802  (comprised  of  50 
period  Ioq  i3Gao.i7.A.s(  100A)/Ga.-\s(125A)  ) 
gives  a  room  temperature  HWHM  of  6.2meV 
and  per  well  modulation  of  -0.004  at  a  field 
of  -  90kV/cm. 
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Fig.  I  Comparison  of  the  product  of  maximum 

change  in  the  absorption  (Aa)  and  well  width 
(d  J  for  use  sample  RG900420  and  RG900726 
with  others  in  the  literature. 
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Given  these  MQW  characteristics,  .ASFP  reflection  modulators  were  grown  in  which  such 
MQWs  are  sandwich  between  high  quality  Al^Ga|..,jAs/ALGa|.yAs  Bragg  mirrors.  Our  sample 
RG9I0II5  consists  of  17.5  periods  of  AIAs(784A)/Ga.4s(654A)  as  the  bottom  mirror  with 
computed  reflectivity  Ru  =  99.4%,  3  periods  for  the  top  mirror  of  computed  =  68%,  and  in 


between  a  50  period  Ino.nG«0.89As(*OOAVCaAs(125A)  which  is  expected  to  have  similar 
behavior  as  samples  RG960420  ^  RG900802  discussed  above.  The  bottom  and  top  minors  are 
n^  and  p^  doped  except  for  the  last  and  the  first  period,  respectively.  The  grown  samples  were 

patterned  into  280pm  diameter  test  pixels  and  125pm  diameter  Cr-Au  contact  pads  were 
deposited  on  top  of  the  mesas  to  provide  top  electrical  contact  The  back  contact  is  provided  by 
the  In  used  for  mounting  the  GaAs  substrate  on  the  Mo  block  of  the  MBE  system.  The  reflection 
spectra  were  taken  in  a  normal  incidence  geometry  with  an  automated  Ti/Sapphire  laser  pumped 
by  an  Ar*  laser,  a  beam  splitter  cube,  and  two  Si  detectors.  Fresh  deposited  Au  was  used  for  the 
reflectivity  calibration. 


Fig.  2  (a)  shows  the  reflectivity  of  pixel  (4, 
31)  of  this  sample  at  OV,  8V  and  16V  reverse  bias. 
Note  that  by  about  -  8V  applied  bias  the 
reflectivity  at  the  zero  bias  F-P  wavelength  has 

dropped  from  ~19.2%  to  ~1.4%  with  a  slight  red 

shift  in  the  FP  position  itself.  The  modulation 
effect  is  thus  predominantly  an  electroabsorptive 
one  in  this  regime.  With  further  increase  in  the 
reverse  bias,  the  reflectivity  has  dropped  to  ~ 

0.46%  at  the  blue-shifted  F-P  wavelength  of 
938  lA  at  -16V.  The  blue  shift  in  the  F-P 
wavelength  is  a  consequence  of  electro-refraction 
setting  in.  The  reflectivity  contrast  ratio  (CR)  and 

change  (AR)  between  zero  and  -16V  applied  bias 
are  shown  as  a  function  of  the  wavelength  in  Fig. 
2,  panels  (b)  and  (c),  respectively.  A  maximum 
contrast  ratio  of  66:1  with  an  attendant  AR  of 
-30%  is  seen  at  9381 A  corresponding  to  the 

modulator  operation  in  the  *normally-on’ 
condition.  The  details  are  reported  in  Ref.  17. 


Wavelength  (A) 

Fig. 3  Quantum  efficiency  (Q.  E)  at  OV,  -8V,  and 
-16V  biases  and  reflectivity  (R)  at  -16V  bias  of 
pi.xel  (4,31)  as  a  function  of  wavelength.  The  thick 
curve  on  the  top  is  the  sum  of  Q.E.  and  R  at  -16V 
bias. 
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Fig.  2  Normalized  reflectivity  as  a  function 
of  wavelength  at  different  applied  biases  for 
pixel  (4,31).  The  reflectivity  change  and 
contrast  ratio  between  zero  and  -16V  bias 
are  shown  in  panels  (b)  and  (c),  respectively. 

Since  the  ASFP  is  in  a  p-i-n 
configuration,  such  a  modulator  can  also  be 
used  as  a  detector.  Fig.  3  shows  the 
quantum  efficiency  of  the  same  pixel  at  OV, 
-8V  and  -16V.  The  near  zero  quantum 
efficiency  at  zero  bias  is  indicative  of  the 
nearly  flat-band  condition  operative.  This 
was  a  design  goal  achieved  by  no  doping  of 
the  last  period  of  the  bottom  n'*'  mirror  and 
the  first  period  of  the  top  p’'  mirror.  At  -8V 
the  quantum  efficiency  at  Xpp  is  near  75% 


and  by  -16\'  it  has  reached  near  82'^a.  The  reflectivity  spectrum  at  -16V  is  reproduced  here  to 

show  that,  as  expected,  the  high  quantum  efTiciency  at  Xpp  is  a  consequence  of  the  increased 
absorption  due  to  multiple  reflection  under  the  FP  resonance  condition.  Without  the  FP  cavity, 

the  absorpti  'n  depth,  ad,  at  the  FP  wavelength  is  estimated  to  be  ~  0.15  and  thus  will  give 


ma.xiinum  quantum  efficiency  of  ~  14%  only.  The  sum  of  the  reflectivity  and  quantum 
efficiency  at  -16V',  shown  in  Fig.  3.  is  seen  to  be  near  unity  away  from  the  F-P  resonance  region 
but  dips  to  near  80%  at  Xpp.  The  loss  is  due  to  enhanced  ‘leakage*  of  the  light  through  the  bottom 
high  reflectivity  mirror  due  to  the  multiple  in-phase  transmissions  at  FP  resonance.  This  leakage 
is  e.xtremely  sensitive  to  even  small  variations  in  the  reflectivity  of  the  high  reflectivity  mirror. 

From  the  nearly  20%  leakage  we  estimate  the  operative  reflectivity  at  the  -16V  Xpp  to  be  97.8%. 
This  is  to  be  compared  vvith  the  99.4%  reflectivity  design  value  computed  at  zero  bias  on  the 
basis  of  the  GaAs^.Al.As  quaterwave  stack. 
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To  improve  the  optical  quality  of  the 
InGaAs/GaAs  MQW  with  higher  In  content  and 
large  thickness,  structures  were  grown  on  the 
GaAs  (100)  substrates  partially  patterned  with 
mesas  of  various  shapes  and  sizes.  The  substrate 
is  first  chemically  cleaned  using  standard  MBE 
cleaning  procedures  and  then  patterned  using 
photolithography  and  wet  .chemical  etching  as 
reported  in  our  earlier  work  [13].  An  example  is 
sample  RG891 1 10  which  consists  of  a  100  period 

^*^0.20^^0  3o^s(^®'^VGaAs(  1 60A)  MQW  of  ~ 

2.38  pm  total  thickness.  In  the  patterned  part,  the 

mesa  size  is  ~  16  pm  x  18  pm  and  the  pitch  is 

40pm  in  both  directions.  Fig.4(a)  shows  a  cross 
sectional  transmission  electron  microscope 
(XTEM)  image  contrast  taken  with  the  electron 

beam  along  the  [110]  azimuth  for  a  specimen 
prepared  from  the  non-patterned  part  of  the 
sample.  .A  large  number  of  structural  defects, 
including  threading  dislocations,  are  seen  to  be 
present  throughout  the  MQW  structure.  Figs.  4(b) 
and  4(c)  show  the  corresponding  image  contrast 
taken  (at  higher  magnification)  in  the  patterned 
part  between  mesas  and  on  top  of  the  .nesa, 
respectively.  Remarkably  perfect  InGaAs  (dark) 
and  Ga.As  (light)  layers  are  seen  with  hardly  any 
evidence  of  structural  defects  at  the  level  of  the 
TEM  resolution.  However,  towards  the  edges  of 
the  mesas,  the  layering  was  found  not  to  be  good 

within  2  to  3  pm  of  the  edges,  even  though  no 
defects  were  visible  using  the  customarily  chosen 
[1 10]  azimuth.  The  presence  of  contiguous  mesa 
top  plane  and  the  sidewall  planes  (typically  (111) 
with  some  high  index  facets  such  as  {311}/{4ll}, 
{11,1,1}  etc.)  gives  rise  to  interfacet  cation 
migration  whose  degree  and  nature  is  sensitive  to 
the  growth  conditions,  as  we  have  previously 
demonstrated  [13].  We  therefore  suspect  that 
interfacet  migration  is  in  part,  if  not  largely, 
responsible  for  the  degradation  of  the  sharpness  of 
layering  near  the  edges  of  the  mesas. 


Fig.5  Optical  transmission  behavior  of  sample 
RG891110  for  the  MQW  in  the  nonpattemed 
region  (a)  and  patterned  part  between  mesas  (b) 
and  on  top  of  the  mesa  (c). 


Rg.  S,  panels  (a),  (b)  and  (c)  show  the 
corresponding  room  temperature  transmission 
characteristics  of  this  sample.  The 
transmission  spectra  were  obtained  using  a 
hiidtly  collimated  beam  sampled  from  a 
tungsten  halogen  lamp,  a  lOOx  microscope 

objective  lerts  which  results  in  2  to  3pm  beam 
size  on  the  sample.  A  SPEX  1704 
monochromator  and  a  detector 

was  used  for  detection.  The  spectra  are 
normalized  to  the  system  response.  The  non¬ 
pattemed  region  shows  no  excitonic  features, 
consistent  with  the  TEM  findings  (Fig.  4a). 
By  contrast,  the  MQW  growth  on  top  of  the 
mesas  and  in  between  the  mesas  shows  not 
only  the  usual  heavy  hole  (hh)  to  first  confined 

electron  (le)  excitonic  transitions  at  ~lpm 

wavelength,  but  even  the  light  hole  (Ih)  to  Ic 
and  second  hh  to  second  electron  excitonic 
features  are  resolved,  indicating  the  high 
quality  of  the  MQW  in  these  regions.  This 
also  is  consistent  with  the  TEM  findings  (figs. 
4(c)  and  4(b).  The  HWHM  of  the  exciton  for 
the  .MQW  in  the  region  between  the  mesas  and 
on  top  of  the  mesa  are  6.2meV  and  I4meV, 
respectively.  While  a  clear  explanation  of  the 
sharper  e.xciionic  feature  in  the  trench  region 
requires  further  examination,  it  may  be 
speculated  that  it  is  partially  due  to  the 
interfacet  migration  since  it  seems  that  the 
high  index  planes  move  towards  the  center  of 
the  mesa  more  than  they  move  outwards.  The 
trench  regions  thus  provide  regions  for 
realization  of  good  light  modulator  pixels. 
The  linewidth  approaching  the  value  obtained 
for  lower  In  composition  MQWs  grown  on 
non-patterned  substrates  and  which,  in  the 
ASFP  configuration,  give  very  high  contrast 
ratio  and  reasonable  throughput  indicates  that 
such  MQWs  can  offer  good  electromodulation 
behavior  at  wavelength  near  1.06um. 


In  conclusion,  we  have  demonstrated  that  high  quality  strained  InCaAs/GaAs  (xsO.li) 

MQWs  of  large  thicknesses  (52pm)  and  with  good  electroabsorption  characteristics  can  be 
grown  individually  and  with  integrated  Bragg  mirrors  by  optimizing  the  MBE  growth  condition. 
The  resulting  material  has  lead  to  the  successful  demonstration  of  InGaAs/Ga.As  strained  ASFP 
modulators  with  room  temperature  contrast  ratios  as  high  as  66:1  while  maintaining  a  high 
throughput  of  30%.  These  modulators  provide  some  technological  advantages  over  the 
AlGaA^GaAs  modulators  and  can  be  integrated  with  other  III-V  devices  as  well  as  Si 
electronics  to  realize  optical  neuron  units  (6]  and  digital  photonic  switches  [5].  The  behavior  of 
the  latter,  utilizing  ASFP  modulators  and  detectors  realized  as  pan  of  this  work,  is  presented  in 
the  paper  by  Kapre  et.  al.  in  this  volume.  At  high  In  compositions  (^20%),  high  quality  and 
very  thick  {~2um)  InGaAs/GaAs  quantum  wells  have  been  achieved  through  growth  on  pre- 

pattemed  substrates  and  are  expected  to  be  useful  for  high  contrast  ratio  modulators  near  lum. 
This  work  was  supported  by  the  .AFOSR,  URI(AFOSR),  DAPR.A  and  ONR. 
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High  contrast  ratio  InGaAs  based  asymmetric  Fabry-Perot  reflection  modulators,  realized  in 
both  conventional  and  inverted  geometry,  were  used  to  obtain  high  contrast  optically  bistable 
switches  in  various  schemes  utilizing  both  “normally  on”  and  “normally  off"  types  of 
modulators.  The  flexibility  provided  by  the  inverted  structure  in  realizing  novel  optical  and 
electrical  interconnections  is  demonstrated. 


I 


INTRODUCTION 


I 


mmetric  Fabry-Perot  (ASFP)  light  modulators  are  of 
iterest  for  their  high  contrast  ratio  and  large  throughput. 

Ke  growth  of  InGaAs/GaAs  based  ASFP  modulators  has 
;n  especially  challenging  since  they  require  not  only  the 
igh  reflectivity  Bragg  mirrors  and  very  accurate  (  <  1%) 
uclcness  control,  but  also  growth  of  high  quality  strained 
■Itiple  quantum  wells  (MQW).  Based  on  our  using  pre- 
Rus  success  in  the  growth  of  such  InGaAs/GaAs  MQW 
iflection  high-energy  electron  diffraction  (RHEED)  de- 
■mined  growth  conditions,'*^  we  have  realized  InGaAs 
^ed  ASFP  modulators  with  contrast  ratio  as  high  as  66: 1 
nd  30%  attendant  throughput.^’**  The  advantages  of  In- 

IAs  based  ASFP  modulators  on  GaAs  substrates  have 
n  particularly  afforded  in  the  inverted  geometry*’*  in 
ich  light  passes  in  and  out  through  the  substrate.  Such  a 
eometry  allows  replacing  the  growth  of  the  high  reflectiv- 

I Bragg  mirror  ( ~2.5  ftm  thick)  by  an  externally  depos- 
i  mirror  as  well  as  postgrowth  tuning  of  the  Fabry- 
erot  cavity.  This  geometry  also  allows  hybrid  integration 

Ith  Si  circuitry  through  flip-chip  bonding.  In  addition,  in 
[  efforts  towards  all  III-V  integration,  such  a  geometry 
ords  simultaneous  optical  access  from  both  sides  of  the 
l^ulator/detector  chip. 

■The  objective  of  this  article  is  to  explore  realization  of 
Rh  contrast  bistable  switches  using  both  “normally  on” 
nd  “normally  off”  ASFP  modulators,  especially  in  the 
■e/'/ed  geometry.  For  “normally  on”  modulators,  bistable 
ptching  has  been  demonstrated  in  the  monolithic  opto- 
lectronic  transistor  (MOET)  configuration  proposed^  by 
Ijlliamson  and  co-workers.  In  this  configuration  [Fig. 
■i)],  a  tunneling  diode  and  a  field  effect  transistor  (FET) 
R  used  with  two  pixels  of  the  modulator/detector  chip  to 
btain  bistability  and  gain.  For  “normally  off”  modulators, 
itable  switching  has  been  achieved  in  various  self-electro- 
Ric  device  (SEED)  configurations.*  In  the  usual  SEEDs, 
le  negative  differential  resistance  (NDR)  required  for 

Itable  operation  is  based  on  physical  effects  that  cause 
luction  in  the  absorption  coefficient  at  the  operating 
'avelength  due  to  an  exitonic  red  shift  under  an  applied 
lectric  field.  By  contrast,  in  the  present  work  the  NDR  is 
■ained  from  the  blue  shift  of  the  Fabry-Perot  mode  due 


to  the  reduction  of  refractive  index  near  the  exciton  shoul¬ 
der.  Additionally,  we  show  that  it  is  possible  to  integrate  a 
binary  phase  Fresnel  microlens  array  with  the  inverted 
spatial  light  modulators  (SLM)  for  lensiess  free  space  op¬ 
tical  interconnection. 

II.  EXPERIMENTAL 

The  samples  were  grown  in  the  USC  RIBER  32P  solid 
source  molecular-beam  epitaxy  (MBE)  system  under 
RHEED  determined  growth  conditions.,^  The  conventional 
ASFP  modulator  consists  of  a  17.5  period  GaAs(654 
A)/AlAs(784  A)  Bragg  Mirror,  a  50  period 
Ino.„Gaog»As(100  A)/GaAs(125  A)  MQW,  and  a  5  pe¬ 
riod  p  Bragg  mirror  grown  on  a  n  ^  GaAs  substrate.  The 
inverted  modulator  consists  of  a  5.5  period  n  Bragg  mir¬ 
ror,  the  same  MQW  as  above,  and  an  externally  deposited 
Au  mirror,  grown  on  a  semi-insulating  GaAs  substrate. 
The  growth  temperatures  for  the  first  mirror  and  the 
MQW  were  600  and  545  'C,  respectively,  in  both  the  struc¬ 
tures  and  555  *C  for  the  second  mirror  in  the  conventional 
structure.  The  growth  rates  were  0.76  and  0.60  monolayer 
(ML)/s  for  the  GaAs  and  AlAs  in  the  mirror,  and  0.26 
and  0.29  ML/s  for  the  GaAs  and  InGaAs  in  the  MQW. 
The  resonant  tunneling  diode  (RTD)  consists  of  a  triple¬ 
well,  double-barrier  structure  with  two  16  ML 
Ino25Gao75As  spacers,  two  10  ML  AlAs  barriers,  and  one 
18  ML  In,,  25Gao  75 As  well.’ 


III.  RESULTS 

The  modulators/deiectors  used  in  the  MOET  switch 
[Fig.  1(a)]  are  two  pixels  of  the  InGaAs/GaAs  conven¬ 
tional  ASFP  modulator  array.  The  modulator  pixel  pro¬ 
duces  a  contrast  ratio  of  20:1,  and  a  reflectivity  change  of 
44.4%  at  14  V  reverse  bias  when  the  incident  light  power 
is  below  0.4  mW.  When  the  incident  power  is  increased,  a 
higher  voltage  is  required  to  compensate  for  the  voltage 
drop  across  the  contact  due  to  increased  photocurrent.  The 
InGaAs/AlAs  RTD  of  Sec.  II  and  a  commercial  Si-FET 
were  used.  The  12  pm  x  12  ^im  cross-section  RTD  shows  a 
peak  current  =  0.6  mA  and  a  valley  current  of  /,  =0.2 
mA.  Details  of  the  RTD  and  the  modulator  characteristics 
are  reported  in  Refs.  ^  and  10.  respectively.  Figure  1(b) 
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Input  Power  (0.2  mW/div) 

(b) 


Fig.  1.  (a)  Circuit  diagram  of  the  monolithic  optoelectronic  transistor 
configuration  (after  Ref.  4).  (b)  Optical  switching  performance  of  the 
MOET  for  supply  a  voltage  =  20  V  and  =  —  1  V,  The  lowest 
trace  corresponds  to  the  zero  output  intensity  level. 


shows  the  input/output  characteristics  of  this  switch.  At 

V  ^  =  20  V  and  V  ^  =  —  1  V,  a  contrast  ratio  of  20:1  and 
a  A/?  of  44.4%  are  obtained,  the  same  as  the  modulator 
pi.xel  employed.  A  fan-out  of  2  is  realized.  The  fan-out  can 
be  significantly  increased  by  reducing  the  peak  current  of 
the  RTD.  Recently,  we  have  successfully  decreased  the 
peak  current  by  an  order  f  magnitude  by  reducing  the 
RTD  cross  section  to  5  /xmx5  fim  and  increasing  the 
RTD  barrier  thickness.  Moving  towards  an  all-(III-V) 
MOET.  we  have  also  examined  the  use  of  a  GaAs  metal- 
semiconductor  field  effect  transistor  (MESFET)  in  place 
of  the  Si-FET.  A  contrast  ratio  of  4:1  is  realized  at  the  12 

V  maximum  bias  that  the  MESFET  could  sustain.  Further 
improvement  requires  redesign  of  either  the  GaAs  FET  for 
higher  voltage  operation  or  the  modulator  for  lower  volt¬ 
age  operation. 

Turning  to  SEEDs,  “normally  off”  pixels  of  modulator 
of  Sec.  11  were  employed  in  conjunction  with  a  Si  photo¬ 
diode  (D-SEED),  Si  phototransistor  (T-SEED),  or  an¬ 
other  modulator  pixel  (S-SEED).  The  pixel  (10.10)  has  a 
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Fig.  2.  (a)  Photocurrent  behavior  of  pixel  (10,  10)  as  a  function  of  the 
applied  reverse  bias  with  ISO  /rW  of  illumination  at  9484  A.  (b)  The 
D-SEED  switching  characteristics  of  pixel  ( 10,  10).  The  light  beam  to  the 
modulator  was  scanned. 

FP  mode  at  9484  A  with  a  reflectivity  of  —0.4%  at  zero 
bias.  At  20  V  reverse  bias,  the  FP  mode  blue  shifts  to  945 1 
A  and  the  reflectivity  at  9484  A  increases  to  —20%.  This 
is  manifest  in  the  photocurrent  /-F  characteristics  ob¬ 
tained  under  9484  A  illumination  [Fig.  2(a)],  which  shows 
onset  of  NDR  at  5  V.  Figure  2(b)  shows  the  switching 
input/output  characteristics  of  the  pixel  using  a  Si  photo¬ 
diode  as  the  load.  A  contrast  ratio  of  —50:1  and  a  AR  of 
—  20%  is  obtained  for  this  D-SEED. 

To  compare  the  characteristics  of  the  D-SEED  and  the 
T-SEED,  Figs.  3(a)  and  3(b)  show  the  photo  /-F  char¬ 
acteristics  of  the  Si  photodiode  and  the  Si  phototransistor 
used  in  conjunction  with  modulator  pixel  (11,11).  To  ob¬ 
tain  the  same  current  output  from  the  photodiode  and  the 
phototransistor,  the  light  power  to  the  phototransistor  was 
reduced  by  a  factor  of  200.  Note  that  the  photodiode  has  a 
larger  differential  resistance  than  the  phototransistor  under 
reverse  bias.  This  difference  is  reflected  in  the  switching 
input/output  characteristics  of  the  D-SEED  and  the  T- 
SEED,  shown  in  Figs.  3(c)  and  3(d),  respectively.  The 
D-SEED  has  a  narrower  hysteresis  but  well  maintained  on 
and  off  states.  The  hysteresis  in  the  T-SEED  is  much  wider 
and  has  a  well  maintained  logic  state  only  in  its  central 
region.  The  contrast  ratios  of  the  D-SEED  and  the  T- 
SEED  are  nearly  the  same.  Though  not  shown  here,  two 
modulator  pixels  were  used  to  construct  the  first  S-SEED 
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FiC.  3.  Panels  (a)  and  (b)  show  the  photocurrent  I-V  characteristics  of 
a  UDT  PIN-SD  Si  photodiode  and  Archer  SOP  8403-301  Si  phototrans¬ 
istor.  Note  that  in  order  to  obtain  the  same  current  output  from  the 
photodiode  and  the  phototransistor,  the  light  power  to  the  phototransis- 
toi  was  reduced  by  a  factor  of  200.  Panels  (c)  and  (d)  show  the  corre¬ 
sponding  D-SEED  and  T-SEED  switching  characteristics  for  pixel  (II, 
1 1 ).  The  beam  to  the  modulator  was  scanned. 


based  on  the  inverted  ASFP  modulator.  A  bistable  switch¬ 
ing  contrast  ratio  of  S:1  was  achieved,  comparable  to  that 
reported  for  the  unstrained  GaAs/AlGaAs  MQW  based 
S-SEEDs.  With  improved  device  processing  this  figure  can 
be  improved  to  as  good  as  that  of  the  D-SEED.  Recently 
dir<^:t  growth  of  a  heterojunction  phototransistor  on  the 
inverted  InGaAs  ASFP  modulator  to  achieve  a  monolith- 
ically  integrated  strurtur.  has  also  been  successfully 
realized.  ‘ ' 

Finally,  we  turn  to  the  fabrication  of  binary  phase 
Fresnel  microlenses  for  integration  with  the  modulator  ar¬ 
ray  in  the  inverted  geometry.  Figure  4(a)  shows  an  array 
of  nine  sets  of  binary  phase  Fresnel  lenses  fabricated  using 
focused  ion  beam.  These  len  es  are  a  simplification  of 
Fresnel  lens  and  consist  of  alternating  concentric  zones  in 
two  discrete  plateaus  rather  than  a  continuous  manner.'^ 
The  center  to  center  distance  between  lenses  in  the  picture 
is  37  /xm  in  both  directions.  An  array  of  focused  spots  was 
clearly  seen  on  the  first  order  focal  plane  [Fig.  4(b)].  Fur¬ 
ther  studies  regarding  the  design  of  the  lens,  the  transfer 
efficiency,  the  intensity  profile  at  the  focal  plane,  stray  light 
level,  and  cross  talk  between  the  lenses  are  in  progress.'^ 

In  conclusion,  high  contrast  ratio  optically  bistable 
switches  are  demonstrated  using  strained  InGaAs  based 


Fig.  4.  (a)  NomaixlU  micrograph  of  an  array  of  nine  sets  of  binary  phase 
Fresnel  lenses  made  via  focused  ion  beam,  (b)  Photc^raph  of  the  incideni 
light  taken  at  the  focal  plane  of  the  lenses. 

ASFP  modulators  in  both  "normal”  and  “inverted”  geom¬ 
etry  and  for  both  "normally  on"  and  “normally  off”  types 
of  modulators.  The  flexibility  of  the  inverted  modulator 
structure  is  explored  and  the  possibility  of  integrating  a 
microlens  array  and  SLM  is  demonstrated. 
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The  spatial  distribution  of  the  long-wavel'ngth  luminescence  in  thick  lno2L>aogAs/GaAs 
multiple  quantum  wells  (MQWs)  grown  cn  patterned  and  unpatterned  substrates  has  been 
investigated  using  cathodoluminescence  (CL)  imaging  and  spectroscopy.  By  spatially 
correlating  the  luminescence  arising  from  the  MQW  exciton  recombination  (A=:950  nm)  with 
the  longer  wavelength  ( lOOOSA ^  1200  nm)  luminescence  arising  from  the  defect-induced 
recombination,  we  demonstrate  that  it  is  possible  to  determine  the  regions  of  h  ighest  film  quality 
in  both  the  mesa  and  valley  regions  for  growth  on  patterned  GaAs  substrates.  The  present 
approach  enables  a  judicious  determination  of  the  optimal  regions  to  be  used  for  active  pixels  in 
InGaAs/GaAs  devices.  For  growth  on  unpattemed  substrates,  the  CL  spectra  show 
defect-induced  broad  bards  between  1000  A  £  1600  nm.  These  bands  exhibit  spatial  variations 
which  correlate  with  the  dark  line  defects  (DLDs)  observed  in  the  A  =  950  nm  exciton 
luminescence  imaging.  Transmission  electron  microscopy  showed  that  [1 10]-oriented  misfit 
dislocations  occur  primarily  at  the  substrate-to-MQW  interface.  The  large  spatial  variation  of 
the  luminescence  intensities  indicates  that  the  DLDs  observed  in  CL  images  are  caused  by  the 
presence  of  nonradiative  recombination  centers  occurring  in  the  MQW  region  located  above  the 
interface  dislocations.  This  study  provides  new  information  describing  the  origin  and  nature  of 
DLDs  and  differs  from  previous  models,  which  have  regarded  the  electronic  nature  of 
dislocation  cores  as  the  primary  mechanism  for  inducing  DLD  radiative  contrast  in 
luminescence  imaging  of  strained  InGaAs/GaAs. 


^TRODUCTION 

sently,  there  is  a  considerable  interest  in  using  selective- 
I  epitaxial  growth  approaches  to  achieve  the  fabrication 
hick  coherent  films  of  strained  InGaAs  and  related 
tiple  quantum  well  structures  (MQW)  structures  on 
Vs  substrates.  Many  applications  in  the  area  of  optical 
iputing,  communication,  and  detection  will  benefit 
n  the  continuing  advancements  made  in  the  molecular- 
n  epita.xial  (MBE)  growth  techniques  of  highly 
ined  InGaAs  films.  Previous  studies  employing  trans- 
iion  electron  microscopy  (TEM).'"*  photolumines- 
:e  (PL),''  cathodoluminescence  (CL),*  optical 
irption,''^  and  micro-Raman, ■*  have  demonstrated  a 
itantial  reduction  in  the  density  of  misfit  dislocations 
jrring  in  thick  In^Ga,  ^  ^As  films  and  MQWs  grown  on 
Vs  mesas  with  dimensions  between  1  and  -^00  ^m. 

'he  technique  of  CL  microscopy  enables  the  measure- 
it  of  luminescence  behavior  with  a  scale  of  approxi- 
ely  micron  or  less;  this  is  quite  valuable  for  the  optical 
■acterization  of  Itl-V  semiconductor  device  arrays 
ing  active  pixels  with  such  dimensions.  The  spatial  dis- 
jtion  of  the  near-infrared  (900  5;/.  S  1600  nm)  lumi¬ 


nescence  in  thick  Ino2GaogAs/GaAs  MQWs  grown  on 
patterned  and  unpatterened  substrates  has  been  examined 
with  CL  imaging  and  spectroscopy.  We  show  that  by  spa¬ 
tially  correlating  the  luminescence  arising  from  electron- 
to-heavy  hole  (e-hh)  exciton  recombination  in  the  MQWs 
with  the  longer  wavelength  luminescence  arising  from  the 
defect-induced  recombination,  it  is  possible  to  determine 
the  regions  of  highest  film  quality  in  both  the  mesa  and 
valley  regions  for  patterned  InGaAs/GaAs  MQW  struc¬ 
tures.  The  effects  of  cation  m.gration  near  faceted  sidewalls 
of  chemically  etched  rectangular  mesas  are  examined. 

For  growth  of  MQVx's  on  unpatterned  substrates,  the 
long-wavelength  luminescence  features  are  found  to  corre¬ 
late  spatially  with  the  dark  line  defects  (DLDs)  seen  in  the 
imaging  of  the  MQW  exciton  recombir  ition  at  A~950 
nm.  Plan-view  and  cross-sectional  TEM  show  that  misfit 
dislocations  in  the  samples  are  confined  to  the  region  near 
the  interface.  The  luminescence  intensities  from  various 
MQW  structures  are  quantified,  and  a  large  variation  in 
intensity  of  the  lummescence  associated  with  the  DLDs 
suggest  that  most  of  the  QWs  in  the  vicinity  of  the  DLDs 
are  affected  by  the  misfit  dislocations.  This  int,.nsity  vari- 
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ation  is  found  to  be  relatively  indq)endent  of  the  electron 
beam  probing  depth,  and  indicates  that  the  QWs  are  af¬ 
fected  homogeneously  throughout  all  the  layers,  in  con¬ 
trast  to  the  presoice  of  misfit  dislocations  which  occur  only 
at  the  MQW-to-substratc  interface.  This  luminescence  be¬ 
havior  is  attributed  to  a  relative  increase  in  the  number  of 
nonradiative  recombination  centers  occurring  in  regions 
above  the  interface  dislocations.  These  results  suggest  that 
nonradiative  point  defects  above  the  interface  dislocations 
are  left  in  the  wake  of  dislocation  propagation  and  multi¬ 
plication.  In  the  Si  system  Kimerling  and  co-workers,^  us¬ 
ing  deep-level  transient  spectroscopy,  have  speculated  that 
it  is  the  point  defects  gettered  around  dislocations  which 
are  largely  responsible  for  defect  induced  midgap  states 
and  the  associated  recombination  properties.  Similarly,  we 
demonstrate  that  a  Cottrell  atmosphere  of  point  defects 
surrounding  dislocations  is  the  most  reasonable  explana¬ 
tion  for  the  DLD  behavior  observed  in  this  study  of 
Ino  jGao  gAs/GaAs  MQW  structures. 

II.  EXPERIMENT 

MQW  samples  were  grown  by  MBE  using  standard  In, 
Ga,  and  As  sources.  Our  study  of  growth  on  unpattemed 
substrates  involves  65  A  thick  Ino2GaogAs  QWs  sur¬ 
rounded  by  GaAs  barriers.  In  samples  designated  D92, 
D18,  and  D38,  44  periods  of  Itio  jOaiogAs  .MQWs  were 
grown  with  barrier  thicknesses  of  780,  400,  and  115  A, 
respectively.  In  sample  D 179,  a  65  A  MQW  structure  hav¬ 
ing  100  and  1230  A  barriers  with  14  periods  (28  QWs) 
were  grown.  A  control  sample,  D155,  was  grown  with  one 
65  A  thick  QW.  One  of  the  samples,  D92,  was  periodically 
8  doped;  Be  and  Si  doping  planes  with  concentrations  of 
9.0 X  10‘^  and  3.0 X  lO'^  cm  “  ^  were  inserted  at  the  centers 
and  290  A  off  centers  of  the  GaAs  barriers  to  produce  a 
nipi  effect.  This  sample  was  grown  originally  for  the  pur¬ 
pose  of  studying  suitable  structures  for  optically  addressed 
spatial  light  modulators.®  The  spatial  separation  of  elec¬ 
trons  and  holes  enables  a  large  optically  induced  absorp¬ 
tion  modulation  for  this  structure;  this  has  been  recently 
reported.®  '’  For  patterned  growth,  a  n  “  -type  Ga.A.s(  100) 
substrate  was  patterned  using  conventional  optical  lithog¬ 
raphy  and  wet  chemical  etching. The  cp.stallographi- 
cally  selective  etch  results  in  undercut  and  inclined  side- 
wall  planes  [mostly  {111}  with  some  high  index  facets)  at 
the  mesa  edges.  The  resulting  mesas  had  dimensions  of 
about  2  pm  thick  X  16  ^mX  18  pm  with  a  40  um  spacing  in 
both  directions  before  MBE  growth.  A  scanning  electron 
micrograph  (SEM)  image  of  a  typical  mesa  is  shown  in 
Fig.  1.  About  half  the  wafer  was  left  unpattemed  in  order 
to  allow  for  e.xamination  of  a  reference  sample.  The  struc¬ 
ture  grown  on  top  consisted  of  a  100  period  la.  ;Gao  sAs 
(80  .A.)/Ga.A.s(  160  A)  MQW’  (having  ~2.38  um  total 
thickness)  with  a  5000  A  p  " -type  Ga.'Xs  capping  layer. 
Cross-sectional  TE.M  (XTEM)  and  absorption  results  of 
this  sample  (designated  RG891110)  have  beer,  previously 
described.'  '  CL  measurements  were  performed  with  a 
JEOL  840-F  fieid-emission  SE.M  at  the  Jet  Propulsion 
Laboratory.'"  North  Coast  EO-817L  Ge  detector 

was  used  to  measure  the  signal  dispersed  b>  a  0  25  m 
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Fig.  I.  SEM  of  a  typical  mesa  of  the  IngjGaoiAs/GaAs  MQW  structure. 
The  dashed  and  solid  lines  with  arrows  oriented  along  (1 10)  and  (1 10). 
respectively,  indicate  the  position  of  the  electron  beam  during  line  scans 
Points  A-E  indicate  beam  position  during  local  CL  spectroscopy. 

monochromator.  An  electron  beam  current  of  0.5  nA  at 
accelerating  voltages  ranging  from  10  to  40  kV  was  used  to 
probe  the  sample  for  the  CL  measurements.  The  sample 
was  maintained  at  a  temperature  of  ~77  K. 

III.  RESULTS  AND  DISCUSSION 
A.  ln0.2Gao.sAs  growth  on  patterned  GaAs(IOO) 

A  SEM  image  of  the  structure  showing  a  220  pmX  170 
^m  area  is  presented  in  Fig.  2(a).  Scanning  monochro¬ 
matic  CL  images  corresponding  to  the  same  region  are 
shown  in  Figs.  2(b)-2(d)  for  wavelengths  of  950,  1040, 
and  1 120  nm,  respectively.  Regions  of  increasing  lumines¬ 
cence  signal  are  represented  by  areas  of  lighter  shades.  The 
image  with  =  950  nm  [in  Fig.  2(b)]  shows  a  hatched 
region  of  bright  luminescence  with  maximum  intensity  in 
the  valley  regions  midway  along  the  [010]  diagonal  (dotted 
line  in  Fig.  1 )  between  mesa  centers.  There  is  some  observ¬ 
able  emission  from  the  centers  and  edges  of  the  mesas  at 
this  wavelength.  A  dark  halo  region  is  found  to  surround 
the  mesas,  where  little  emission  is  detected.  As  the  wave¬ 
length  is  increased  to  1040  and  1 120  nm,  the  images  in  Fig. 
2  show  an  enhanced  emission  from  the  centers  and  sides  of 
the  mesas  relative  to  the  valley  regions.  Also,  for  these 
wavelengths  (and  for  other  CL  imaging  wa\elengths  in  the 
region  1000  <7. <1200  nm  not  shown  here),  the  lumines¬ 
cence  patterns  on  top  of  most  of  the  mesas  show  a  distinct 
dumbbell  shape  with  maximal  intensity  near  the  mesa 
edges  parallel  to  [iTO],  as  seen  in  Figs.  2(c)  and  2(d).  The 
dumbbell  shape  on  the  mesas,  as  seen  in  the  /.  >1000  nm 
CL  images  and  the  [1 10]  CL  line  scan,  indicates  the  pres¬ 
ence  of  interfacet  cation  migration  in  which  the  In  migrates 
up  along  the  inclined  edges  The  presence  of  contiguous 
low  inde.v  planes  is  known  to  cause  interfacet  migration: 
this  behavior  has  been  observed  to  cause  compositional 
variations  in  the  .AlGa.-Xs  anil  lnG:i.\^  lilms  grown  on  pat¬ 
terned  substrates' 
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(a)  SEM  of  RG891110 


(c)  X  =  1040  nm 


Fig.  2.  SE.M  (a)  and  ^caIlnJ^g  motuvhroniaiii;  cathodoluminescence  images 
respectuels)  of  the  -..ime  region  of  the  In,, -Ga,. ,.\s/Ga.-\s  MQW  structure 


Previously.  TEM  measurements  of  the  mesa  region  tor 
RG891110  has  shown  that  the  linear  defect  density  to  be 


less  than  ~  10'  cm 


Since  the  ma.ximum  spatial  resolu¬ 


tion  in  CL  is  limited  by  the  carrier  dilTusion  length  (  ~  1 
pm  for  G'd.\s).  we  do  not  e.xpect  to  see  the  effects  of  indi¬ 
vidual  dislocations  It  will  be  shown  below  (in  Sec.  Ill  B) 
that  for  the  D-series  tinpailerned  samples,  the  presence  ol 
underlvini’  misfit  dislocations  at  the  M()\\ -to-substrate  in¬ 
terface  !:ive>  ri'-c  lo  .i  crosshatched  pattern  when  the  linear 
defect  densiis  ic  le^c  than  —  10'  cm 

Locali.'.ed  Cd.  spectra  are  shown  m  Fie.  locations 
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[(b).  (c).  and  (d)  correspond  lo  wavelengths  of  950,  1040.  and  1120  nm. 


A-E  correspond  to  points  on  a  mesa  center,  mesa  edge, 
valley  region  near  mesa,  midway  between  mesas  along 
[110],  and  midway  between  mesas  along  [010],  respec¬ 
tively.  as  labeled  in  Fig.  1.  The  spectrum  labeled  Ref.  cor¬ 
responds  to  an  unpatterned  region  and  does  not  e.shibit  a 
measurable  e.xciton  luminescence.  The  peak  at  A  =  950  nm 
corresponds  to  the  e-hh  e.xciton  luminescence.  The  regions 
e.shibiiing  the  largest  defect-induced  luminescence 
(  I()(K)  S  A  S  1 2(X)  nm  )  on  the  mesa  are  found  in  close  pro.\- 
imitv  to  the  inclined  regions  at  the  mesa  edges  (position  B 
III  Lie.  i )  which  rexult  from  th.e  crystallographic  selectivity 


locations 
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Fig.  3.  Localized  CL  spectroscopy  at  the  points  A-E  shown  In  Fig.  I. 
The  sample  labeled  Ref.  corresponds  to  the  spectnim  taken  in  the  non- 
pattemed  region  of  the  Itig  ^Gao  (As/GaAs  MQW  structure. 


of  the  etch.  The  region  possessing  the  highest  film  quality 
has  been  determined  by  measuring  the  exciton-to-defect 
emission  ratio  for  a  variety  of  positions  in  the  structure. 
The  data  show,  surprisingly,  that  the  highest-quality  film 
exists  in  the  valley  regions  which  are  midway  between  the 
mesa  centers  (location  E  in  Fig.  1)  and  not  at  the  mesa 
centers  (location  A  in  Fig.  1).  These  results  are  expected 
to  influence  current  thinking  with  regard  to  the  design  and 
processing  of  pixelated  devices. 

B.  Growth  of  Ino  jGaogAs  on  unpatterned  GaAs(IOO) 

TEM  was  performed  on  the  D38,  D18,  D92,  and  D179 
samples.  The  MQW  regions  showed  well-ordered 
InojGa^j  gAs  and  GaAs  which  were  visibly  free  of  struc¬ 
tural  defects.  Only  at  and  below  the  MQW-to-substrate 
interface  did  there  appear  to  be  disorder  in  the  form  of 
interface  misfit  dislocations  and  looping  dislocations  that 
propagated  into  the  GaAs  substrates.  Furthermore,  by  pro¬ 
gressively  thinning  away  the  back  side  of  the  substrates,  we 
determined,  using  plan  view  TEM,  that  the  misfit  disloca¬ 
tion  cores  were  confined  soley  to  the  MQW/GaAs  interfa¬ 
cial  region. 

CL  images  of  D38,  D18,  D92,  D155,  and  D179  are 
shown  in  Figs.  4(a)-4(e).  The  wavelength  used  for  the 
imaging  was  nm  and  corresponds  to  the  e-hh  tran¬ 

sition  in  the  MQW  samples.  The  DLDs  are  evident  in  each 
of  the  images  except  for  D155,  the  single  QW  sample  [Fig. 
4(d)].  The  absence  of  misfit  dislocations  in  the  single  QW 
sample  is  to  be  expected  since  the  nominal  critical  thick¬ 
ness  of  a  single  Ino2GaogAs  layer  grown  on  GaAs(lOO)  is 
—  1 50-200  A.'*  These  images  showing  DLDs  here  are  sim¬ 
ilar  to  luminescence  images  of  thick  InGaAs  films  previ¬ 
ously  studied.'’ 

In  order  to  further  assess  the  luminescence  properties, 
we  e.samined  CL  spectra  of  the  unpatterned  InGa.As  sam- 


(d) 
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Fig.  4.  Scanning  moncchromaiic  CL  images  of  the  e-hh  e.xcitor.  lumines¬ 
cence  for  A =950  nm.  The  sample  number  and  MQW  structure  are  de¬ 
scribed  in  (a)-(e).  The  20 /rm  scale  indicated  in  (a)  also  represents  the 
scale  for  (b)-(d):  note  the  factor  of  2  scale  change  in  (e)  A  dashed  line 
along  (110)  in  (e)  indicates  the  electron  beam  path  during  the  hne  scan 
measurements  performed  for  D179 


pies  in  the  wavelength  range  900  5  4  5  1600  nm.  The  spec¬ 
tra  (not  shown  here)  are  similar  to  that  shown  in  Fig.  3  for 
the  growth  on  the  patterned  substrates;  four  distinct  fea¬ 
tures  in  the  luminescence  were  observed  centered  at  950, 
1250.  1380,  and  1460  nm.  The  peak  at  ?.  =  950  nm  is  the 
e-hh  transition.  Joyce  et  al.}  '  using  PL,  have  observed 
emission  in  the  1000  5/1  5  loOO  nm  range  for  thick  InGa.As 
films  grown  on  Ga.As.  These  emissions  were  found  to  in- 
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Fig.  S.  cl  line  scans  (intensity  vs  position  histoj^ram)  '  r  MQW  sample 
Id179.  The  electron  beam  position  is  indicated  in  Fig.  4|  The  peaks  and 
Ivalleys  in  the  A  =  9S0  nm  scan  of  the  e-hh  exciton  luminescence  corre- 
utes  with  the  peaks  and  valleys  of  the  longer  wavelength  1230,  1380,  and 
1460  nm  scans- 


crease  markedly  when  the  In^  ,7GaQ  g3As  layer  thicknesses 
^xceeded  the  critical  thickness,  and  Joyce  et  al.  concluded 
Khat  this  behavior  was  due  to  an  increase  in  interface  de¬ 
fects,  such  as  misfit  dislocations,  beyond  the  critical 
thickness.'^  In  order  to  test  this  hypothesis,  we  have  deter- 

tnined  the  spatial  distribution  of  all  distinct  emissions  in 
he  wavelength  range  900SA:S1600  nm.  We  have  mea¬ 
sured  intensity  versus  position  histograms  or  line  scans 

I  (see  Fig.  5)  which  quantitatively  give  the  spatial  variation 
in  luminescence  intensity  for  the  different  wavelengths. 
The  line  scans  for  wavelengths  of  1250,  1380,  and  1460  nm 
—have  essentially  the  same  spatial  variation  as  the  A  =  950 
Bim  scan,  i.e.,  peaks  and  valleys  in  the  longer  wavelength 
•scans  correspond  with  peaks  and  valleys  in  the  A  =  950  nm 
scan.  This  is  a  surprising  result  since,  from  Ref.  12,  we 

t/ould  expect  the  region  around  the  DLDs  to  yield  an 
nhanced  emission  rather  than  the  reduced  emission  at 
longer  wavelengths. 

a  The  nature  of  the  dislocation  cores  (i.e.,  electronic 
Ktructure  and  reconstruction)  in  strained  III-V  materials 
"is  known  to  effect  the  recombination  process.  The  studies 
of  Fitzgerald  et  al.*'  and  PetroflF  et  aO*  on  strained  films 

■have  shown  that  60°  dislocations  and  edge  dislocations 
have  different  nonradiative  recombination  rates  in  the  vi¬ 
cinity  of  these  dislocations.  It  is  apparent  from  the  data  of 

I  Fig.  5  that  the  nonradiative  mechanisms  responsible  for  the 
reduction  in  the  e-hh  exciton  luminescence  are  also  respon¬ 
sible  for  reduction  in  the  lOOO^ASlbOO  nm  emissions. 

I  This  results  in  longer  wavelength  DLDs  spatially  corre¬ 
lated  with  the  conventional  e-hh  exciton  DLDs.  The  origin 
of  the  1000  AS  1600  nm  emissions  is,  evidently,  not  di¬ 
rectly  caused  by  the  presence  of  misfit  dislocations  and  may 

I  be  due  to  the  presence  of  point  defects.  Similar  broad  and 
midgap  emissions  have  been  observed  in  thick  and  heavily 
strained  GaAs  films  grown  on  Si  which  are  believed  to 

■contain  interstitial,  antisite,  and  vacancy  defects.'^  In  ad¬ 
dition.  we  can  not  rule  out  emissions  due  to  the  presence  of 


Beam  Energy  (keV) 

Fig.  6.  The  normalized  standard  deviation  (u^)  of  the  e-hh  exciton 
luminescence  line  scans  Tor  MQW  samples.  The  standard  deviation  has 
been  normalized  to  the  mean  of  the  exciton  luminescence  for  each  line 
scan.  The  large  magnitude  of  the  variation  (see  discussion)  precludes  the 
possibility  of  only  one  QW  in  each  structure  being  alTected  by  a  single 
dislocation  core  at  the  MQW-to-substrate  interface. 


Ino.2G%gAs/GaAs  interfaces  (i.e.,  recombinations  involv¬ 
ing  interface  states).  Other  types  of  structural  defects  can 
also  induce  midgap  emissions.  A  particularly  striking  re¬ 
sult  in  Fig.  5  is  the  large  intensity  variation  seen  in  the  line 
scan  data.  The  variation  of  the  luminescence  in  the  A 
=  950  nm  line  scans  can  be  quantified  by  evaluating  the 
standard  deviation  of  all  points  shown  in  a  line  scan,  and 
normalizing  the  standard  deviation  with  respect  to  the 
mean.  In  order  to  reduce  the  uncertainties  due  to  variation 
in  the  signal-to-noise  caused  by  changes  in  the  lumines¬ 
cence  signal  associated  with  changing  the  excitation  con¬ 
ditions  (current  and  beam  energy)  and  different  sample 
thicknesses,  a  cubic  spline  fit  was  used  to  determine  the 
“smooth  curve”  through  each  line  scan.  The  normalized 
standard  deviation  Ov  of  fitted  A  =  950  nm  line  scans  are 
shown  in  Fig.  6  for  all  samples  with  a  beam  energy  ranging 
from  10  to  40  keV.  A  general  trend  is  observed  in  which  g,v 
is  fairly  constant  over  a  large  energy  range  and  increases 
slightly  with  a  decreasing  probing  depth  for  energies  less 
than  about  20  keV.  The  total  thicknesses  of  the  MQW 
region  of  D179,  D92,  D38,  and  D18  are  2.0,  3.7,  0.8,  and 
2.0  fim,  respectively,  and  a  given  variation  in  the  probing 
depth  will  affect  different  regions  of  each  sample.  From  the 
above  TEM  results,  we  know  that  misfit  dislocations  occur 
only  in  the  region  at  the  MQW-to-substrate  interface.  We 
should  expect,  therefore,  that  only  recombination  in  the 
QW  closest  to  the  interface  will  be  disturbed  directly  by  the 
presence  of  a  misfit  dislocation.  A  predicted  a^r  for  a  44 
MQW  structure  in  which  the  luminescence  from  one  QW 
at  the  interface  is  completely  suppressed  by  a  misfit  dislo¬ 
cation,  assuming  a  homogenous  carrier  excitation  and  a 
simple  sinusoidal  spatial  variation  in  the  luminescence,  is 
Ov  =  0.008.  This  is  at  least  an  order  of  magnitude  too  low 
to  explain  the  present  results  of  Fig.  6.  Instead  the  magni¬ 
tude  of  Ov  and  its  relative  constancy -for  each  sample  in 
Fig.  6  suggest  that  the  nonradiative  mechanisms  giving  rise 
to  DLDs  are  spread  homogeneously  throughout  all  MQWs 
in  each  sample,  and  the  effects  become  slightly  more  pro- 
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pi^poed  for  more  aurface  aensitive  probing  oonditiona. 

I  Plicae  reaulta  have  aignillcant  ramifications  concerning 
the  origin  .f  the  DLDs  aeen  in  this  and  previous  studies  of 

ft  InGa^iS  films  grown  on  GaAs.  In  previous  studies,  it 
simply  assumed  that  the  electronic  nature  of  the  misfit 
dislocations  were  responsible  for  the  creation  of  competing 
naradiative  channels.^'*  In  that  model,  nonradiative  mi- 
nBty  carrier  recombination  occurs  at  the  reconstructed 
dwocation  cores  as  a  result  of  localized  band  bending  in 
t^  vicinity  of  the  cores;  the  density  of  defect-induced 
sfles  in  the  band  gap  and  doping  concentration  will  de- 
tJHnine  local  depletion  region  sizes.  However,  the  present 
data  suggest  that  carrier  recombination  at  the  dislocation 

r;  is  a  minor  effect  in  producing  the  DLDs  observed  in 
This  can  be  proven  immediately  by  considering  the 
nipi  structure  of  sample  D92.  The  modulation  doping  will 

Sse  electrons  and  holes  to  spatially  separate  for  carriers 
ch  do  not  recombine  at  the  location  of  the  Irig jCa^  gAs 
^s,  and  this  results  in  negligible  carrier  diffusion  along 
the  (lOOJ  growth  direction.*’  Therefore,  it  is  apparent  that 

tmbination  of  carriers  which  are  generated  more  than 
MQW  period  away  from  the  dislocation  core  would  be 
negligible  in  this  model.  The  only  reasonable  explanation 

*the  behavior  of  is  as  follows.  The  existence  of  non- 

iative  recombination  centers  spread  homogeneously  in 
MQW  material  above  the  dislocations  is  the  primary 

ftse  for  the  large  variation  in  the  MQW  exciton  lumines¬ 
ce.  The  nonradiative  centers  will  evidently  compete 
h  the  longer  wavelength  1000  1600  nm  radiative 

channels,  resulting  in  the  spatial  correlation  of  DLDs  im- 

td  at  A=:950  nm  with  DLDs  imaged  at  longer  wave- 
gths  (as  illustrated  in  Fig.  5  for  line  scans).  Dislocation 
propagation  and  multiplication  such  as,  e.g.,  the 

iigen-Strunk’*  and  surface  half-loop  nucleation'^  mech- 
isms  are  expected  to  disturb  the  material  located  above 
e  dislocations.  Heavily  dislocated  materials,  such  as  that 
seen  in  the  strained  GaAs/Si  system  are  known  to  contain 

I  significant  density  of  point  defects.’*’  It  is,  therefore, 
ely  that  a  “Cottrell  atmosphere”  of  point  defects  left  in 
the  wake  of  dislocation  propagation  and  multiplication 
Mupled  with  the  concomitant  expitaxical  growth  over  the 
Kinities  of  such  propagation  are  responsible  for  the  ob¬ 
served  behavior  of  <7\. 
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The  defect  distributkNi  in  a  highly  strained  ln<uGao.|As/GaAs  multiple-quantum-well  (MQW) 
structure  grown  on  a  patterned  GaAs  substrate  is  examined  with  cathodoluminescence  imaging 
and  spectroscopy  in  the  near  infrared.  By  spatially  correlating  the  luminescence  arising  fiom  the 
MQW  exdton  recombirutkm  (Xs:9S0  nm)  with  the  longer  wavelength  ( 1000 1200  nm) 
luminescence  arising  from  the  defect-induced  recombination,  we  derrumstratc  that  it  b  possible 
to  determine  the  regions  of  highest  film  quality  in  both  the  mesa  and  valley  regions.  The  present 
approach  enables  a  judicious  determination  of  the  optimal  regions  to  be  used  for  active  pixek  in 
InGaAs/GaAs  spatial  light  modulators. 


Presently,  there  is  a  considerable  level  of  interest  in 

tising  selective-area  epitaxial  growth  approaches  to  achieve 
he  fabrication  of  thick  pseudomorphic  films  of  strained 
Inj,Ga|_j,As  and  related  multiple-quantum-well  (MQW) 
Structures  on  GaAs  substrates.  Previous  studies  employing 
transmission  election  microscopy  (TEM).'~*  photolumi¬ 
nescence  (PL),^*  cathodoluminescence  (CL),‘  optical  ab- 
^(orption,''^  and  micro-Raman^  have  demonstrated  a  sub¬ 
stantial  reduction  in  the  density  of  mbfit  dblocations 
occurring  in  thick  In^^Gai  _j,As  films  and  MQWs  grown  on 
GaAs  mesas  with  dimensions  between  1  and  400  (im.  In 

trder  to  optimize  the  procedure  by  which  to  fabricate  fimc- 
ioning  device  arrays  having  such  lateral  dimensions,  it  is 
essential  to  be  able  to  elucidate  the  optical  and  structural 

rroperties  on  the  scale  of  a  micrometer  or  less. 

In  this  letter,  we  employ  the  technique  of  cathodolu¬ 
minescence  scanning  electron  microscopy  (SEM)  to  exam¬ 
ine  spatial  and  spectral  variations  in  the  defect-induced 
Biear-infrared  luminescence  of  a  thick  MQW  structure  of 
inoi2Gao.8As/GaAs  grown  by  molecular-beam  epitaxy 
fMBE)  on  a  prepatterned  GaAs  substrate.  The  present 

Ktudy  demonstrates  the  complex  interplay  between  the 
lesa  side  walls  and  the  growth  kinetics  in  the  valley  re¬ 
gions.  These  results  are  expected  to  influence  current 

f  hinking  regarding  the  design  and  fabrication  of  spatial 
ight  modulator  device  arrays  requiring  the  highest  possi¬ 
ble  material  quality  in  isolated  pixel  regions.^'^ 
a  The  rt'^-type  GaAs(  100)  substrate  was  patterned  using 
Bx>nvcntionat  optical  lithography  and  wet  chemical  etch¬ 
ing. The  crystallographically  selective  etch  results  in  un¬ 
dercut  and  inclined  sidewall  planes  at  the  mesa  edges.  The 

I  resulting  mesas  had  dimensions  of  about  2  /xmXlb 
Xl8  nm  with  a  40-^m  pitch  in  both  directions  before 
MBE  growth.  A  SEM  image  of  a  typical  mesa  is  shown  in 

I  Fig.  1.  The  structure  consists  of  a  100-period  Ino2GaogAs 
(80  A)/GaAs  (160  A)  MQW  (having  —2.38  /xm  total 


'’Also  with  Depanmeni  of  Physics,  University  of  Southern  California. 


thickness)  with  a  S0(X)-A  p*-typa  GaAs  capping  layer. 
Cross-sectional  TEM  and  absorption  results  of  thb  sample 
(designated  RG891 1 10)  have  been  previously  described.'*^ 
Cathodoluminescence  measurements  were  performed  at 
the  Jet  Propulsion  Laboratory.*  An  electron  beam  cuirent 
of  0.5  nA  at  an  accelerating  voltage  of  40  kV  was  used  to 
probe  the  sample  for  the  CL  measurements.  The  sample 
was  maintained  at  a  temperature  of  ~77  K. 

A  SEM  image  of  the  structure  showing  a  220  ^mX  170 
fua  area  is  presented  in  Fig.  2(a).  Scanning  monochro¬ 
matic  CL  images  corresponding  to  the  same  region  are 
shown  in  Figs.  2(b)-2(d)  for  wavelengths  of  950,  1040, 
and  1120  nm,  respectively.  Rqpons  of  increasing  lumines¬ 
cence  signal  are  represented  by  areas  of  lighter  shades.  The 
image  with  A=950  nm  [in  Fig.  2(b)]  shows  a  hatched 
region  of  bright  luminescence  with  maximum  intensity  in 
the  valley  regions  midway  along  the  [010]  diagonal  (dotted 
line  in  Fig.  1 )  between  mesa  centers.  There  is  some  observ¬ 
able  emission  from  the  centers  and  edges  of  the  mesas  at 
this  wavelength.  A  dark  hale  region  is  found  to  surround 
the  mesas,  where  little  emission  is  detected.  As  the  wave¬ 
length  is  increased  to  1040  and  1 120  nm,  the  images  in  Fig. 
2  show  an  enhanced  emission  from  the  centers  and  sides  of 
the  mesas  relative  to  the  valley  r^ions.  Also,  for  these 
wavelengths  the  luminescence  patterns  on  top  of  most  of 
the  mesas  show  a  distinct  dumbbell  shape  with  maximal 
intensity  near  the  mesa  edges  parallel  to  [iTo],  as  seen  in 
Figs.  2(c)  and  2(d). 

In  order  to  further  quantify  the  spatial  distribution  of 
the  luminescence,  we  have  performed  a  set  of  line  scans 
over  the  single  mesa  which  is  depicted  in  the  SEM  image  of 
Fig.  1.  In  Fig.  3,  the  CL  intensity  for  the  three  different 
wavelengths  of  840,  950,  and  1120  nm  arc  plotted  as  a 
function  of  the  electron-beam  position  (AT)  along  the 
dashed  and  solid  lines  in  Fig.  1  which  are  oriented  along 
(1 10]  and  [iTO],  respectively.  The  arrows  in  Fig.  1  indicate 
an  increasing  AT  in  the  line  scan  which  has  its  origin  at  the 
intersection  of  the  dashed  and  solid  lines;  the  point  labeled 
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FIG.  I.  SEM  micrograph  of  a  typical  mesa  of  (he  InojCagiAs/GaAs 
MQW  structure.  The  dashed  and  solid  lines  with  arrows  oriented  along 
[110]  and  [I To],  respectively,  indicate  the  position  of  (he  electron  beam 
during  line  scans.  Points  A-E  indicate  beam  position  during  local  CL 
spectroscopy. 


A  (AJf=0)  is  located  in  the  approximate  center  of  the 
mesa. 

These  results  were  further  investigated  by  localized  CL 
spectroscopy  of  the  region  depicted  in  the  SEM  image  of 


(a)  SEM  of  RG891110  (b)  \  =  950  nm 


(c)  X  =  1040  nm  (d)  X  =  1120  nm 


ITG  2  SE.M  (a)  and  scanning  monochromatic  cathodoluminescence 
images  ((b).  (c),  and  (d)  correspond  to  waseicngihs  of  950.  1040.  and 
1120  nm,  respeciively)  of  the  same  region  of  ihe  fn,,,Ga„,.As/GaAs 
MO'V  siructure 


AX  (jtm) 


FIG.  3.  Cathodolumincscence  monochromatic  line  scans  along  Ihe  solid 
and  dashed  lines  depicted  in  Fig.  1;  SlX=0  corresponds  to  point  A  in 
Fig.  1. 


Fig.  1 .  The  electron  beam  was  fixed  at  certain  points  indi¬ 
cated  in  Fig.  1  with  tic  marks  (labeled  A,  B,  C,  D,  and  E), 
and  CL  spectra,  as  presented  in  Fig.  4,  were  taken.  Multi¬ 
plication  factors  used  to  scale  the  absolute  intensity  of  the 
spectra  arc  indicated.  The  peak  at  A  =  840  nm  is  the  result 
of  emission  from  the  GaAs  capping  layer.  The  sharp 
Gaussian-shaped  peak  at  ~9S0  nm  is  the  MQW  e-hh  ex- 
citon  transition  which  has  a  large  variation  in  intensity 
dependent  on  the  beam  position.  From  the  two  line  scans 
of  Fig.  3,  the  A =950  nm  peak  is  seen  to  maximize  at 
positions  of  AAT^^rlO  ^m  from  the  center  of  the  mesa. 
Further  inspection  of  the  CL  image  of  Fig.  2(b)  and  the 
local  spectra  in  Fig.  4  at  point  E  (  —  28  (xm  from  point  A), 
however,  indicates  that  the  regions  halfway  along  the  [010] 
diagonal  between  mesa  centers  yield  the  strongest  exciton 
peak  intensity.  A  strong  emission  in  the  lOOOiSA iS  1200 
nm  region  (in  Fig.  4)  is  observed  for  certain  points  A  and 
B  on  the  mesa;  two  broad  bands  centered  at  A  ~  1040  nm 
and  A  ~  1 1(X)  nm  are  seen.  Previously,  from  photolumines¬ 
cence  of  single  In^  pGao  gyAs  QWs  having  various  .thick¬ 
nesses,  a  long  wavelength  luminescence  was  found  in  the 
lOOOSAS  1600  nm  region.’  This  was  attributed  to  pres¬ 
ence  of  interface  defects,  such  as  misfit  dislocations  at  the 
InGaAs/GaAs  interface.  Thus,  it  is  reasonable  to  attribute 
the  longer  wavelength  CL  emission  to  the  presence  of 
structural  defects  which  have  previously  been  observed  in 
cross-sectional  TEM  of  this  sample.''^  The  difference  in 
these  two  types  of  structures  (single  QW  vs  MQW  and  In 
alloy  composition)  would  naturally  give  rise  to  differences 
in  defect  types  and  densities,  and  would  therefore  yield 
different  luminescence  spectra.  The  defect-induced  lumi¬ 
nescence  is  seen  to  be  strongest  near  AAT—  ±6  (xm  in  the 
A  =  1 1 20  nm  line  scan  of  Fig.  3,  which  is  near  the  centers  of 
dumbbell -shaped  lobes  seen  in  the  CL  images.  The  exciton 
emission  is  markedly  reduced  at  point  B  (AA'=  —9  ftm) 
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FIG.  4.  Localized  calhodoluminescence  spectroscopy  at  the  points  A-E 
shown  in  Fig.  I.  The  sample  labeled  Ref  corresponds  to  the  spectrum 
taken  in  the  nonpaitemed  region  of  the  InojGaoiAs/GaAs  MQW  struc¬ 
ture. 

which  is  near  the  edge  of  the  mesa  in  Fig.  1.  This  is  con¬ 
sistent  with  enhancement  of  the  defect-induced  emission  at 
point  B  since  a  large  presence  of  defect-related  recombina¬ 
tion  channels  are  expected  to  compete  with  the  exciton 
recombination  and  reduce  the  total  carrier  lifetime. 

The  electron  beam  was  further  positioned  within  the 
dark  halo  region  surrounding  the  mesa  (see  Fig.  2(b)]  at 
point  C  (AA'=  — 15  fim)  in  Fig.  1.  A  significant  decrease 
in  both  the  exciton-  and  defect-related  features  is  seen. 
Thus,  the  presence  of  additional  nonradiative  recombina¬ 
tion  channels  caused  by  changes  in  the  defect  distribution 
dominate  the  carrier  lifetime  in  this  region.  The  electron- 
beam  positions  D  (AX=  —22  fxm)  and  E  yield  CL  spectra 
with  a  large  increase  in  the  exciton  luminescence  and  si¬ 
multaneous  reduction  in  the  defect  feature.  Also  at  point  E, 
the  largest  exciton-to-defect  emission  ratio  was  found;  this 
indicates  the  region  with  the  best-film  quality.  The  spec¬ 
trum  labeled  Re/  in  Fig.  4  corresponds  to  a  point  on  the 
nonpattemed  region  of  the  sample.  Only  a  broad  and 
shifted  defect-related  band  is  observed  with  no  exciton 
emission.  This  is  consistent  with  the  marked  reduction  in 
the  threading  dislocation  density  in  the  patterned  region 
relative  to  the  nonpattemed  region,  as  previously  observed 
in  cross-sectional  TEM.'’^ 

The  dumbbell  shape  on  the  mesas,  as  seen  in  the 
A>1000  nm  CL  images  and  the  [110]  CL  line  scan,  indi¬ 
cates  the  presence  of  interface!  cation  migration  in  which 
the  In  migrates  up  along  the  inclined  edges.  The  presence 
of  contiguous  low  index  planes  is  known  to  cause  interfacet 
migration;  this  behavior  has  been  observed  to  cause  com¬ 
positional  variations  in  the  AlGaAs  and  InGaAs  films 
grown  on  patterned  substrates. The  wavelength  po¬ 
sitions  of  the  exciton  peak  at  points  A  and  E  are  954  and 
946  nm,  respectively.  If  attributed  exclusively  to  a  varia¬ 
tion  in  the  In  content,  this  shift  corresponds  to  an  In  com¬ 
positional  change  upper  limit  of  —0.5%. "  This  is  too 


small  a  shift  to  alone  account  tor  toe  eatreuic  ucicm- 
ioduced  luminescence  variatioas  obwrved  heic.  The  spe¬ 
cific  structural  nature  of  the  defects  responsible  for  the 
long-wavelength  luminescence  cannot  be  determined  from 
the  present  study.  The  defect-induced  emisskm  is,  however, 
likely  influenced  by  the  proximity  of  the  region  to  the 
growth  on  the  contiguous  (111)  and  higher  index  facets 
and  the  attendant  interfacct  cation  migration.  It  is  often 
desirable  to  eliminate  the  effects  of  cation  migratkm  on  the 
faceted  walls  altogether  by  growing  on  elongated  (110] 
striped  mesas.^^ 

In  conclusion,  we  have  demonstrated  the  effectiveness 
of  cathodoluminescence  imaging  and  spectroscopy  in  the 
near  infrared  in  determining  the  spatial  distribution  of  the 
defect-induced  bands  and  exciton  line  for  an  InGaAs 
MQW  grown  on  patterned  GaAs.  The  presence  of  inclined 
mesa  edges  and  cation  migration  is  seen  to  influence  the 
defect  distribution  for  this  particular  class  of  patterned 
MBE  growth.  The  presence  of  a  characteristic  dumbbell 
shape  in  the  defect-induced  luminescence  pattern  suggests 
that  these  regions  are  far  from  ideal  for  applications  in 
spatial  light  modulators.  Instead,  regions  midway  between 
the  mesa  centers  yield  extremely  large  exciton-to-defect 
luminescence  ratios.  This  suggests  that  these  regions,  with 
further  device  processing,  could  serve  as  high-conirasl-ratio 
light  modulators.^  ’ 
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Analysis  of  weighted  fan-out/fan-in  volume 
holographic  optical  interconnections 


Praveen  Asthana,  Gregory  P.  Nordin,  Armand  R.  Tanguay,  Jr.,  and  B.  Keith  Jenkins 


The  feasibility  of  employing  volume  holographic  techniques  for  the  implementation  of  highly  multiplexed 
weighted  fan.out/fan>in  interconnections  is  analyzed  on  the  basis  of  interconnection  fidelity,  optical 
'  «  throughput,  and  oompiodty  of  recording  schedule  or  implementation  hardware.  These  feasibility 

criteria  were  quantitatively  evaluated  using  the  optical  beam  propagation  method  to  numerically  simulate 
the  difliaction  characteristics  of  volume  holographic  interconnections  recorded  in  a  linear  holographic 
material.  We  find  that  conventional  interconnection  architectures  (that  are  based  on  a  single  coherent 
optical  source)  exhibit  a  direct  trade-off  between  interconnection  fidelity  and  optical  throughput  on  the 
one  hand,  and  recording  schedule  or  hardware  com|dexity  on  the  other.  In  order  to  circumvent  this 
trade-off  we  describe  and  analyze  in  detail  an  incoherent/coherent  double  angularly  multiplexed 
interconnection  architecture  that  is  based  on  the  use  of  multiple-source  array  of  individually  coherent  but 
mutually  incoherent  sources.  This  architecture  either  minimizes  or  avoids  several  key  sources  of  cross 
talk,  permits  simultaneous  recording  of  interconnection  weights  or.  weight  updates,  and  provides 
enhanced  fidelity,  interchannel  isolation,  and  throughput  performance. 


1.  Introduction 

Volume  holographic  optical  elements  (VHOE’s)  have 
often  been  suggested  as  the  principal  components  of 
an  optical  interconnection  technology  for  applications 
such  as  optical  computing  and  telecommunications 
that  require  a  large  number  of  intercormections.‘~® 
Dependhig  on  the  application,  such  interconnection 
systems  may  also  require  varying  degrees  of  intercon¬ 
nection  weighting,  fan-out,  fan-in,  and  channel  inde¬ 
pendence.  In  particular,  interconnection  systems 
for  artificial  neiiral  networks  provide  a  specific  appli¬ 
cation  for  which  all  of  these  issues  are  important. 

Artificial  neural  networks  are  composed  of  many 
highly  interconnected  nonlinear  computational  ele¬ 
ments  (neuron  units)  that  operate  in  parallel  and  are 
arranged  in  architectural  patterns  that  are  motivated 
to  a  certain  extent  by  investigations  of  biological 
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neural  networks.  The  computational  elements  are 
in  most  cases  densely  interconnected  with  weighted 
connection  pathways  that  can  be  reconfigured  and 
updated  to  permit  either  supervised  or  unsupervised 
learning.  Implementations  of  adaptive  neural  net¬ 
works  should  optimally  permit  such  pathway  recon¬ 
figuration  and  weight  updates  without  excessively 
compromising  either  hardware  complexity  or  compu¬ 
tational  efficien(7. 

Typical  artificial  neural-network  interconnection 
topologies  require  a  high  degree  of  both  fan-out  and 
fan-in  at  each  neuron  unit.  A  fully  interconnected 
topology  for  the  case  of  a  single-layer  network  is 
illustrated  schematically  in  Fig.  1,  in  which  two 
planes  of  neuron  units  are  shown.  In  such  a  fully 
connected  network,  the  output  of  each  neuron  unit  in 
the  input  plane  is  fanned  out  to  all  of  the  neuron  units 
in  the  output  plane.  Similarly,  each  neuron  unit  in 
the  output  plane  receives  the  weighted  fan-in  of  all  of 
the  neuron-unit  outputs  from  the  input  plane.  The 
fan-out/fan-in  requirements  of  neural  networks  act 
as  a  multiplier  on  the  total  number  of  interconnected 
neuron  units,  resulting  in  an  interconnection  system 
that  must  prove  capable  of  supporting  very  large 
numbers  of  independent  pathways  in  a  relatively 
compact  topology.  For  example,  a  fully  connected 
neural  network  having  10®  neuron  units  in  both  the 
input  and  output  planes  requires  10’°  interconnec¬ 
tions.  as  does  a  partially  connected  neural  network 
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ig.  1.  Schematic  representation  of  fan-out/fan-in  interoonnec- 
ions  between  input  and  output  planes  of  neuron  units. 


naving  10^  neuron  units  in  both  the  input  and  output 
^planes  with  a  fan-out  and  fan-in  of  10*. 

The  use  of  volume  holographic  optical  elements  has 
■been  proposed  as  the  basis  of  an  interconnection 
technology  for  neural  networks  precisely  betause  it 
*has  the  potential  to  meet  the  critical  requirements  of 
providing  both  large  numbers  of  interconnections 

I  and  a  weighted  fan-out/fan-in  topology.^  Even  so, 
the  feasibility  of  using  VHOE’s  for  large-scale 
weighted  fan-out/fan-in  interconnection  applications 

(depends  further  on  the  fidelity  with  which  the  inter¬ 
connection  weights  can  be  implemented  and  on  the 
optical  throughput  that  can  be  achieved  in  the  vol¬ 
ume  holographic  interconnection  system.  The  opti- 

Ical  throughput  is  a  measure  of  the  fraction  of  the 
total  incident  optical  power  that  is  difiracted  into  the 
set  of  desired  outputs;  as  such,  it  provides  a  quantita¬ 
tive  assessment  of  the  overall  interconnection  ^stem 

I  efficiency.  Important  additional  implementation  is¬ 
sues  include  the  total  number  of  required  exposures, 
the  optical  power  incident  on  the  holographic  record- 

Iing  medium  per  exposure,  the  complexity  of  the 
implementation  hardware,  the  required  exposure 
schedule  (if  any),  and  the  total  recording  time  for  a 
given  interconnection  complexity. 

I  In  this  paper  we  quantitatively  evaluate  the  perfor¬ 
mance  characteristics  of  two  distinct  volume  holo¬ 
graphic  interconnection  architectures  in- accordance 
with  these  criteria.  The  first  interconnection  archi- 

Itecture  is  novel  in  that  an  array  of  individually 
coherent  but  mutually  incoherent  optical  sources  is 
used  to  generate  a  multiplicity  of  angularly  multi- 

Iplexed  recording  beams.®"**  We  hereinafter  refer  to 
this  t5T>e  of  interconnection  architecture  as  being 
incoherent /coherent  double  angularly  multiplexed. 
The  second  interconnection  architecture  is  based  on 

I  the  use  of  a  single  coherent  optical  source  during 
recording,  as  has  been  widely  discussed  in  the  litera¬ 
ture.®'®"'®  This  conventional  architecture  is  re¬ 
ferred  to  herein  as  a  single-source  architecture. 

I  In  our  analysis  we  use  the  optical  beam  propaga¬ 
tion  method  (BPM)'*  to  investigate  the  diffrac- 


,  tion  characteristics  of  wetthCed  fan-out/fan-in  inter¬ 
connections  in  comple<c  systems  that  inwdve  both 
large  numbers  of  holographic  gratings  and  multiple 
readout  beams,  and  that  therefore  do  not  lend  them¬ 
selves  readily  to  analytical  solutions.  For  our  pur¬ 
poses  herein,  we  consider  explicitly  the  case  of  linear 
holographic  materials  in  oi  d^  to  illustrate  key  differ¬ 
ences  in  interconnection  performance  that  are  archi¬ 
tecture  dependent.  Generalization  of  these  results 
to  nonlinear  holographic  materials  (for  example,  to 
certain  photorefractive  media)  is  beyond  the  scope  of 
this  paper. 

R^ults  are  reported  herein  on  the  numerical  simu¬ 
lation  of  up  to  10-  (input  node)  to-10  (ou^nit  node) 
weighted  fan-out/fan-in  volume  holographic  intercon¬ 
nections  that  incorporate  between  10  and  190  individ¬ 
ual  hol<^raphic  gratings  multiplexed  within  the  same 
region  of  the  volume  h<dographic  recording  medium, 
depending  on  the  specific  architectural  configuration 
considered.  Such  simulations  require  signifijant  run 
times,  even  when  implemented  with  the  highly  effi¬ 
cient  BPM  algorithm  on  a  supercomputer.  To  the 
best  of  our  k^wledge,  the  l()-to-10  case  with  fully 
independent  weights  is  one  of  the  most  complex 
volume  holographic  interconnection  systems  that  ^ve 
been  analyzed  to  date. 

Our  simulations  demonstrate  that  the  novel 
inooherent/coherent  double  angulariy  multiplexed 
architecture  exhibits  both  high  interconnection  fidel¬ 
ity  and  hi^  optical  throughput  effidengr  even  in  the 
presence  of  fully  dmultaneous  recording.  This  com¬ 
bination  of  desirable  characteristics  derives  from  the 
elimination  (or  minimization)  of  several  distinct 
sources  of  interchannel  cross  talk  and  throu^put 
loss  that  are  unavoidably  present  in  the  more  widely 
investigated  single-source  architecture.  In  addition, 
the  mutual  incoherence  of  the  readout  beams  in  the 
inooherent/coherent  double  angular^  multiplexed 
architecture  provides  naturally  for  linearity  of  sununa- 
tion  in  an  intensity  representation  without  an  associ¬ 
ated  fan-in  loss. 

In  order  to  compare  the  fidelity  and  throughput 
performance  of  the  incoherent/coherent  double  angu¬ 
larly  multiplexed  architecture  with  an  appropriate 
benchmark,  we  analyzed  the  single-source  arc^tec- 
ture  (iUustrated  in  Fig.  2)  under  directly  comparable 
holographic  recording  conditions.  In  particular,  we 
examined  the  cases  of  simultaneous,  pagewise- 
sequential,  and  fully  sequential  recorffing  of  the 
desired  wei^t  updates.  These  three  recording  meth¬ 
ods  represent  distinct  trade-offs  between  interchan¬ 
nel  cross  talk  resulting  from  the  presence  of  extrane¬ 
ous  gratings,  on  the  one  hand,  and  recording  schedule 
or  hardware  complexity  on  the  other.  The  presence 
of  such  extraneous  gratings  can  cause  significant 
errors  in  the  diffracted  outputs  that  are  fanned  in  to  a 
given  interconnection  node.  For  example,  in  the 
lO-to-10  single-source  interconnection  system  that 
we  modeled,  errors  as  large  as  100%  in  the  relative 
diffracted  outputs  occur  at  the  peak  optical  through¬ 
put  of  50%  for  the  simultaneous  recording  method. 
To  the  best  of  our  knowledge,  the  analysis  of  the 
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Pif.2.  Schematic  diagram  of  mfingle-sourcc  holographic  interoon- 
ncction  architecture  in  which  diffraction  gratings  in  a  volume 
hologram  connect  pixels  in  the  input  plane  to  pixels  in  the  output 
plane.  Interconnection  gratings  are  formed  by  recording  the 
interference  between  light  from  pixels  in  the  training  plane  and 
light  from  pixels  in  the  input  plane.  Li-Lj  are  lenses;  /i-/i  are 
focal  lengths. 

simultaneous  recording  method  (as  applied  to  the 
sin^e-source  architecture)  presented  herein  estab¬ 
lishes  the  fidelity  errors  and  throughput  losses  of  this 
technique  quantitatively  for  the  first  time.  Elimina¬ 
tion  of  the  extraneous  gratings  by  using  the  fully 
sequential  recording  method  (at  a  substantial  cost  in 
recording  schedule  or  hardware  complexity)  reduces 
the  largest  errors  to  13%  at  an  increa^  pe^  optical 
throughput  of  85%.  These  errors  for  the  fully  se¬ 
quential  recording  case  are  significantly  lower  than 
those  described  in  Ref.  19,  at  least  in  part  because  the 
relative  phase  relationships  among  the  recording 
beams  are  maintained  on  readout  in  our  simulations. 

In  addition,  we  demonstrate  that  the  severe  fan-in 
loss  that  unavoidably  accompanies  the  coUinear  com¬ 
bination  of  mutually  incoherent  beams  of  essentially 
common  wavelength,*^  and  that  also  applies  to  inco¬ 
herent  readout  in  the  single-source  ar^tecture,  is 
ou<i  of  several  effects  of  beam  degeneracy  (defined  in 
Section  4.D).®'‘*  ‘*  “  This  same  physical  mechanism 
is  also  primarily  responsible  for  the  residual  errors 
appiarent  in  the  fully  sequential  recording  method  at 
p>e^  throughput. 

The  remainder  of  this  papor  is  organized  as  follows. 
Spjecific  aspocts  of  volume  holographic  interconnec¬ 
tion  systems  that  are  essential  for  the  establishment 
of  a  valid  comparison  metric  are  discussed  in  Section 
2,  including  the  choice  of  signal  representations  and 
weights  appropriate  for  photonic  neural  networks, 
the  implementation  of  weighted  fan-out/fan-in  inter¬ 
connections  in  a  single-source  architecture,  and  the 
associated  sources  of  fidelity  errors  and  throughput 
losses.  Section  3  details  our  modeling  methodology 
and  assumptions,  anticipating  the  discussion  of  the 
simulation  results  obtained  for  the  single-source 
architecture  that  are  provided  in  Section  4.  We 
present  two  distinct  configurations  of  the  incoher¬ 
ent/coherent  double  angularly  multiplexed  architec¬ 
ture  and  quantitatively  analyze  their  performance 
characteristics  in  Section  5.  Finally,  we  compare  the 
relative  advantages  and  disadvantages  of  each  archi¬ 
tectural  configuration  and  summarize  our  results  in 
Section  6. 


2.  Preliminary  ConceiiU  • 

A.  Basic  Neural-Network  OperMion 

In  many  neural  networks  the  computational  process 
of  a  single  network  layer  can  be  represented  by 

y.  =  fifii).  (1) 

s 

Pi  =  2  ''vf;.  (2) 

y-1 

in  which  neuron  units  i  and  j  are  in  the  output  and 
input  planes.  resp>ectively,  y,  is  (propmrtional  to)  the 
output  of  neuron  unit  i,  x,  is  (proportional  to)  the 
output  of  neuron  unit  j,  is  the  weight  of  the 
interconnection  between  neuron  units  i  and  j,  N  is  the 
number  of  neurons  in  the  input  plane,  f  is  the 
nonlinear  threshold  function  of  each  neuron  unit, 
and  p,  is  the  activation  potential. 

In  a  neural  network  that  incorporates  learning,  the 
weights  Wy  are  upxlated  incrementally  according  to 
an  appropriate  learning  rule  as  training  pottems  are 
presented  to  the  network.  A  large  cla^  of  learning 
algorithms  can  be  characterized  by  the  following 
weig^t-upxlate  rule; 

AW;,  =  (3) 

in  which  AWj,  =  W^{m)  —  W^{m  -  1)  is  the  weight 
update,  a  is  the  learning  gain  constant,  and  m  is  the 
iteration  index.  Various  learning  rules  can  be  formed 
by  suitably  choosing  the  functional  form  of  Sj”*’;  for 
example,  in  the  case  of  Hebbian  learning,  8|'"*  is 
chosen  to  bey^.*® 

B.  Weighted  Fan-Out/Fan-ln  Interconnections  in  a 
Single-Source  Archrtecture 

The  conventional  single-source  volume  holographic 
interconnection  architecture  as  configured  for  pho¬ 
tonic  neural-network  implementation  is  shown  sche¬ 
matically  in  Fig.  2.  This  architecture  consists  of  a 
holographic  interconnection  medium  and  three  planes 
of  neuron  units;  a  training  plane,  an  input  plane, 
and  an  output  plane.  The  two-dimensional  (2-D) 
arrays  of  neuron  units  or  pixels  (since  in  many 
photonic  neural-network  implementations  each  pixel 
correspjonds  to  a  single  neuron  unit)  are  placed  on 
these  planes.  In  applications  other  than  neural  net¬ 
works  these  pixels  can  represent  generic  connection 
nodes.  The  arrays  of  pixels  can  be  implemented 
using,  for  example,  2-D  spatial  light  modulators 
(SLM’s).  Both  the  training-  and  input-plane  SLM’s 
are  illuminated  by  a  single  coherent  source  in  the 
configuration  shown  in  Fig.  2. 

A  given  weighted  interconnection  between  a  pLxel 
in  the  input  plane  and  a  pixel  in  the  output  plane  is 
physically  realized  as  a  single  diffraction  grating  in 
the  holographic  medium.  Each  grating  is  formed  by- 
recording  the  interference  pattern  generated  by  coher¬ 
ent  superposition  of  light  from  a  pixel  in  the  input 
plane  and  light  from  a  pixel  in  the  training  plane. 
For  simplicity  the  pixels  in  both  planes  are  typically 
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s^med  td  act  as  point  sources^-"*  and  lenses  L|  and 
function  as  collimating  lenses.  Light  from  each 
txel  is  thus  incident  as  an  angularly  distinct  coUi- 
fiated  beam  (tjrpically  approximate  as  a  plane 
on  the  holographic  medium.  The  light 
rom  each  pixel  is  assumed  to  fully  illuminate  the 
perture  of  the  holographic  medium  to  ensure  the 
lotential  for  full  connectivity.’  *» 


light  beams  from  the  input-p(am>  pixels  are  mutually 
coherent  (as  is  typically  the  case  for  a  single-source 
architecture),  the  diffracted  signals  detected  in  the 
output  plane  can  be  written  (assuming  an  intensity 
representation  1  as 


P. 


.V 

5]  ^xp|i(d.,  + 

j-i 


(51 


Signal  Representation  in  a  Single-Source  Architecture 
)epending  on  the  computational  algorithm,  the  opti- 
al  signal  from  each  input  pixel  (alter  coUimation )  can 
>e  represented  by  a  number  of  different  physical 
luantities,  including  intensity,  amplitude,  polahza- 
ion,  and  wavelength.  In  the  two  most  commonly 
nvestigated  approaches’  the  signal  is  represented 
nther  by  the  complex  electric  field  or  by  the  intensity 
>f  the  light  transmitted  or  reflected  by  the  pixel 
depending  on  the  type  of  SLM). 

In  the  amplitude  representation  the  electric  field  of 
the  light  from  pixel  j  that  is  incident  at  the  front  face 
of  the  hologram  can  be  expressed  (n^ecting  the 
vector  nature  of  the  electric  field)  as 
exp(n%  •  r  -I-  i6j  -  iW),  in  which  is  the  amplitude 
transmissivity  (or  reflectivity)  of  the  jth  pixel,  Eq  is 
the  magnitude  of  the  electric  field  amplitude  of  the 
readout  beam,  <]>.  is  the  phase  of  the  readout  beam 
(relative  to  an  arbitrarily  chosen  coordinate  system), 
and  <•>  is  the  angular  frequency.  In  this  representa¬ 
tion  a  given  interconnection  weight  is  proportional  to 
the  amplitude  diffraction  efficiency  of  the  correspond¬ 
ing  interconnection  grating. 

In  the  intensity  representation  the  output  of  pixel  j 
is  expressed  as  x/o>  in  which  xj  is  the  intensity 
transmittance  (or  reflectance)  of  the  SLM  pixel  and 
/q  -  Eq.  In  this  case  each  weight  is  proportional  to 
the  intensity  diffraction  efficiency  of  its  correspond¬ 
ing  interconnection  grating. 

When  readout  is  performed  with  beams  from  more 
than  one  input  pixel,  the  intensity  detected  within 
each  output  pixel  consists  of  a  weighted  sum  of 
diffracted  signals.  If  an  optical  system  is  con¬ 
structed  such  that  the  light  beams  from  the  input 
pixels  are  mutually  incoherent  during  readout,  the 
relative  detected  intensity  (using  the  intensity  repre¬ 
sentation)  within  the  ith  output  pixel  p,  is 

Pi  =  2  ^ij^j>  (4) 

J-'i 

in  which  Wjy  is  an  intensity  weight  (which  is  propor¬ 
tional  to  the  diffraction  efficiency  of  grating  ij)}’’ 
The  relationship  expressed  in  Eq.  (4)  is  the  same  as 
that  in  Eq.  (2),  which  defines  how  the  activation 
potentials  are  related  to  the  inputs  by  the  interconnec¬ 
tion  weights  of  the  neural  network.  Thus  readout 
with  mutually  incoherent  beams  in  conjunction  with 
an  intensity  representation  leads  to  an  optical  sj'stem 
that  performs  the  desired  neural  interconnection 
function  of  Eq.  (2). 

If  the  optical  system  is  constructed  such  that  the 


in  which  is  the  phase  of  the  yth  grating.  Each 
grating  phase  is  set  by  the  relative  phases  of  the 
recording  beams  used  to  form  the  grating  and  the 
properties  of  the  holographic  recording  medium. 
If  the  argument  of  the  exponential  is  constant  for  all  i 
and  j,  the  diffracted  outputs  for  a  singie-source 
architecture  using  the  intensity  representation  re¬ 
duce  to 


P. 


(6) 


This  equation  is  valid  only  if  two  specific  conditions 
are  met:  (1)  the  readout  beams  must  have  the  same 
relative  phases  as  were  used  during  recording  of  the 
interconnection  gratings,  and  (2)  the  phase  shift 
induced  by  the  holographic  recording  medium  itself 
must  be  constant  for  all  recorded  gratings. 

Even  with  these  assumptions,  the  square-root  rela¬ 
tionship  embodied  in  Eq.  (6)  deviates  substantially 
from  the  desired  interconnection  function  of  Eq.  (2). 
The  general  effect  of  this  deviation  on  neural-network 
operation  has  not  yet  been  established. 

As  a  result,  several  authors  have  chosen  an  ampli¬ 
tude  representation  for  use  in  the  single-source  ardii- 
tecture,  such  that  the  diffracted  outputs  are  given 
by’-’* 


P.  = 


(7) 


in  which  W“'’  is  the  amplitude  wei^t  (which  is 
proportional  to  the  amplitude  diffraction  efficiency). 
If  the  argument  of  the  exponential  in  Elq.  (7)  is 
constant  for  all  i  and  j,  the  system  yields  the  square  of 
the  desired  interconnection  function,  which  may  be 
corrected  either  electronically  or  by  adjusting  the 
nonlinear  threshold  function  of  each  neuron  unit. 
This  simplification  is  dependent,  as  is  Eq.  (6),  on  the 
two  conditions  specified  above.  The  first  of  these 
conditions  is  contingent  on  the  maintenance  of  rigid 
optical  phase  stability  in  the  system,  which  may  be 
difficult  to  realize  in  practice. 

A  primary  advantage  of  readout  with  mutually 
incoherent  teams  is  the  avoidance  of  the  rigid  phase- 
stability  requirements  that  are  necessary  for  coher¬ 
ently  read  out  systems.’’*-*®  However,  a  serious 
disadvantage  associated  with  the  use  of  mutually 
incoherent  readout  beauns  is  the  significant  through¬ 
put  loss  that  is  characteristic  of  all  holographic 
interconnection  architectures  in  which  the  fan-in  is 
performed  collinearly.’*  Since  single-source  intercon- 
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«iection. sterns  rely  on  eollinear  fan-in,  they  inher¬ 
ently  suffer  from  this  throughput  loss  for  readout 
with  mutually  incoherent  beams.  In  Section  5  we 
discuss  a  method  for  using  readout  with  mutually 
incoherent  beams  without  suffering  an  incoherent 
(an-in  loss. 

A  further  consideration  in  the  implementation  of 
photonic  neural  networks  is  that  many  neural  algo 
rithms  require  the  use  of  bipolar  weights  and  bipolar 
neuron-unit  outputs.  One  method  of  achieving  this 
bipolarity  for  readout  with  mutually  coherent  beams 
is  to  use  an  amplitude  representation  in  which  a  180* 
phase  shift  in  the  phase  of  both  the  electric  fields  and 
the  diffraction  gratings  is  used  to  represent  negative 
numbers.’-^’'  \^ile  attractive  in  principle  (requiring 
only  a  sin^e  data  channel  per  neuron  unit),  this 
approach  is  difficult  to  implement  in  practice,  in  part 
bemuse  the  phases  of  the  resultant  diffracted  outputs 
must  be  detected  as  well  as  their  amplitudes.  An 
alternative  method  of  achieving  bipolarity  is  to  use  an 
intensity  representation  in  conjunction  with  a  dual- 
rril  concept  in  which  each  neuron  has  separate 
positive  and  negative  channels  for  both  input  and 
output  signals  vrith  associated  weighted  interconnec¬ 
tions  for  each  pair  of  channels.  ^*^-*****  Although 
this  method  requires  two  data  channels  per  neuron 
unit,  it  is  compatible  with  simple  square-law  detec¬ 
tors  and  does  not  require  mutually  coherent  readout 
beams. 

For  our  purposes  herein,  we  consider  only  unipolar 
weights  and  unipolar  neural  outputs  since  the  dual¬ 
rail  method  can  be  used  to  generalize  to  the  fully 
bipolar  case.  In  addition,  we  choose  to  adopt  the 
intensity  representation  throughout  in  order  to  facili¬ 
tate  direct  comparisons  between  mutually  coherent 
and  mutually  incoherent  readout  tystems.  In  partic¬ 
ular,  the  intensity  representation  is  perhaps  the  most 
natiiral  representation  for  the  incoherent/coherent 
double  angularly  multiplexed  interconnection  archi¬ 
tecture,  yielding  linear  sum  rules  in  the  diffracted 
outputs.  In  any  case,  this  choice  does  not  affect  the 
general  conclusions  drawn  regarding  interconnection 
fidelity  and  optical  throughput  For  the  various 
cases  of  mutually  coherent  readout  treated  herein,  we 
assume  that  both  conditions  (1)  and  (2)  specified 
above  [following  Eq.  (6)]  are  met. 

D.  Recording  Methods  in  a  Single-Source  Architecture 

For  the  single-source  architecture  shown  in  Fig.  2,  in 
which  the  light  from  each  pixel  is  mutually  coherent 
with  the  light  from  all  such  pixels  in  a  given  plane, 
there  are  several  methods  by  which  the  desired 
interconnection  gratings  may  be  recorded.  All  such 
methods  involve  a  sequence  of  e.\posures,  and  they 
differ  in  the  nature  of  each  individual  exposure.  The 
first  method  is  simultaneous  recording,  in  which 
mutually  coherent  light  beams  from  all  the  input  and 
training  pixels  are  incident  simultaneously  on  the 
holographic  medium  during  each  exposure.  By  re¬ 
cording  the  resultant  interference  pattern,  the  vol¬ 
ume  holographic  medium  forms  gratings  that  con¬ 


nect  each  neuron  unit  in  the  input  plane  with  each 
neuron  unit  in  the  output  p  ane.  These  constitute 
the  desired  iiUtrjUanar  connections  that  pedbrm  the 
weighted  fan-out/fan-in  function.  However,  intra- 
pUxnar  gratings  that  connect  pairs  of  pixels  within  the 
input  plane  (ai^  within  the  training  plane  as  well!  are 
formed  as  a  result  of  the  mutual  coherence  of  the 
sources. These  extraneous  gratings  (termed  co¬ 
herent-recording  cross-talk  gratings,  or  cross  grat¬ 
ings)  introduce  a  serious  source  of  cross  talk  into  the 
interconnection  system  that  in  turn  leads  to  a  loss  of 
boii.  throughput  and  reconstruction  fidelity. 

In  the  pagewise-sequential  recording  method,  a 
single  pixel  in  the  input  plane  is  connected  simulta¬ 
neously  with  all  of  its  associated  training  pixels 
during  each  exposure,  which  eliminates  the  coherent 
cross-talk  gratings  within  the  input  plane  only.  In 
the  fully  sequential  recording  method,  the  remaining 
coherent  cross-talk  gratings  in  the  training  plane  are 
eliminated  by  recording  connections  between  a  single 
pixel  in  the  input  plane  and  a  single  pixel  in  the 
training  plane  during  each  exposure.  In  order  to 
achieve  the  reduction  in  coherent-recording  cross  talk 
offered  by  these  methods,  however,  an  increase  in  the 
recording  schedule  or  hardware  complexity  must  be 
accommodated.  In  certain  photorefractive  media  the 
situation  may  be  complicate  further  by  the  need  for 
a  complex  recording  schedule  to  compensate  for  the 
partial  erasure  of  previously  recorde  interconnec¬ 
tion  gratings  during  the  recording  of  later  grat¬ 
ings.*-^* 

In  a  photonic  neural-network  implementation,  the 
interconnection  weights  can  be  either  precomputed 
(in  a  photonic  or  electronic  computing  tystem)  and 
stored  in  a  permanent  holographic  medium  for  later 
use  or  obtained  adaptively  using  a  suitable  learning 
algorithm  and  a  dynamic  holographic  medium.  For 
the  recording  of  precomputed  weights  it  is  clearly 
advantageous  to  minimize  the  total  number  of  expo¬ 
sures,  the  total  recording  time,  and  the  total  exposure 
energy.  For  a  system  in  which  N  input  nodes  are 
connected  to  N  output  nodes  (referred  to  herein  as  an 
N-io-N  interconnection  system)  the  minimum  num¬ 
ber  of  exposures  requir^  to  record  a  set  of  fully 
independent  weights  is  equal  to  N,  as  can  be  derived 
from  degrees-of-freedom  considerations.**  The  as¬ 
sumption  of  fully  independent  weights  is  equivalent 
to  the  assumption  that  the  weight  matrix  is  of  rank 
N.  If  the  full  set  of  precomputed  weights  is  recorded 
pagewise  sequentially,  therefore,  the  minimum  num¬ 
ber  of  exposures  can  be  achieved. 

For  adaptive  computation  of  weights  a  set  of  train¬ 
ing  pairs  is  present^  sequentially  (one  training  pair 
at  a  time)  to  the  network.  Each  training  pair  con¬ 
sists  of  an  input  image  and  its  corresponding  training 
image,  which  are  presented  on  the  respective  input 
and  training  planes  of  Fig.  2.  The  mth  input  image 
can  be  represented  by  the  vector  x'"",  the  components 
x'"'  of  which  are  shown  in  the  first  term  on  the 
right-hand  side  of  Eq.  (3).  Similarly,  the  mth  train¬ 
ing  image  is  represented  by  the  vector  5''"'.  Ideally, 
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le  full  input  and  training  images  for  the  mth 
aining  pair  lx*"".  6""'|  are  ;>rcseni^  simultaneously 
I  the  input  and  training  planes,  respectively,  such 
lat  only  one  exposure  is  required  for  each  training 
lir.  The  outer  product  of  tl  e  vectors  x""'  and  8'"‘  is 
corded  in  each  such  exposure,  which  thus  corre- 
K>nds  to  the  simultaneous  recording  case  discussed 
Mve.  However,  if  a  pagewise-sequential  or  fully 
■quential  recording  strategy  is  necessary  to  avoid 
le  deletenuas  effects  of  coherent-recording  cross 
ilk,  then  the  number  of  exposures  per  training  pair 
scomes  N  and  N^,  respectively,  for  an  N-to-N  inter- 
>nnection  system.  If  M  training  pairs  are  required 
>  fully  train  the  network,  the  total  tiumt^r  of 
cfMSures  for  the  simultaneous,  pagewise-sequential, 
nd  fully  sequential  recording  methods  are  M,  NM, 
nd  N*Af,  respectively.  If  real-time  adaptation  is 
ot  required  and  precomputation  of  wei^ts  is  permit- 
id,  then  the  minimum  numbers  of  exposures  re- 
uired  for  the  three  recording  methods  are  N,  N,  and 
1*.  respectively. 

The  choice  of  recording  method  has  significant 
ractical  consequences  for  photonic  neural-network 
nplementations  (using  the  singie-souroe  interconnec 
ion  architecture)  in  hich  adaptive  computation  of 
he  weights  is  desired.  As  an  example,  if  the  SLM 
rame  time  is  the  temporal  bottleneck  of  the  system 
which  represents  perhaps  a  worst-case  estimate,  in 
hat  the  single-pixel  access  time  may  be  considerably 
horter  than  the  full  SLM  frame  time  for  certain 
ILM’s),  then  the  total  amount  of  time  required  to 
rain  the  network  scales  linearly  with  the  number  of 
equired  exposures.  For  large  numbers  of  intercon- 
lections  (exactly  the  situation  for  which  holographic 
nterconnections  are  presumably  attractive)  this  could 
•esult  in  impractically  long  training  sessions.  For 
acample,  if  N  =  10^,  Af  =  10*,  the  SLM’s  support  a 
LO-ms  update  rate  (such  as  for  nonferroelectric  Uquid- 
:rystal-based  SLM’s),  and  the  holographic-material 
esponse  time  at  the  available  power  level  is  fast 
mough  to  not  provide  an  even  stricter  bound,  then 
he  simultaneous,  pagewise-sequential,  and  fully  se- 
[uential  recording  methods  would  require  10  s,  10*  s 
28  h),  and  10®  s  (32  years),  respectively,  to  record  the 
lesired  interconnections. 

In  the  pagewise-sequential  recording  method,  one 
echnique  for  avoiding  prolonged  training  sessions 
and  consequently  inefficient  use  of  available  optical 
Kjwer)  is  to  focus  the  full  incident  beam  on  a  given 
>ixel  of  the  input  SLM  amd  to  scan  the  beam  pixel-by- 
)Lxel  during  the  recording  of  a  given  traiining  pair, 
f  scanning  all  of  the  input-plane  pixels  can  be 
iccomplished  vdthin  the  frame  time  of  the  SLM 
which  corresponds  to  a  l-|xs  dwell  time  for  the  above 
txample),  then  pagewise-sequential  recording  can  be 
iccomplished  in  the  same  total  time  as  simultaneous 
ecording.  The  concept  can  also  be  applied  to  the 
ully  sequential  recording  method  (which  requires 
OO-ps  dwell  times  for  the  example  above).  Decreases 
n  the  total  training  time  (with  concomitant  increases 
n  the  efficiency  of  power  ed  energy’  use)  for  both 


recording  methods  are  achieved  at  the  cost  of  addi¬ 
tional  system  complexity,  particularly  since  in  many 
cases  provision  must  also  be  made  for  simultaneous 
readout  of  all  input-plane  pixels  during  posttraining 
computation. 

From  the  joint  perspectives  of  recording  schedule 
and  hardware  complexity,  simultaneous  recording 
may  prove  to  be  the  most  desirable  recording  method 
(in  the  adaptive  neural-network  paradigm)  because 
the  full  parallelism  of  the  optical  architecture  is  used, 
thereby  achieving  the  greatest  computational  through¬ 
put  during  training  without  resorting  to  additional 
components  that  increase  the  system  complexity. 
However,  simultaneous  recording  within  the  single¬ 
source  architecture  is  perhaps  the  least  desirable 
recording  method  from  the  perspective  of  interconnec¬ 
tion  fidelity  resulting  from  the  deleterious  effects  of 
coherent-recording  cross  talk. 

E.  Sources  of  Fidelity  Errors  ar.d  ThroughpHJt  Losses 

In  addition  to  coherent-recording  cross  talk,  there  are 
at  least  two  other  sources  of  fidelity  errors  and 
throughput  losses  that  can  be  oresent  in  multiplexed 
fan-out/collinear  fan-in  interconnections  imple¬ 
mented  within  the  single-source  architecture.  The 
first  of  these  is  grating-degeneracy  cross  talk.*^**-“-*< 
This  form  of  cross  talk  arises  because  of  the  particu¬ 
lar  geometric  placement  of  the  recording  pixels  used. 
If  the  pixels  are  placed  on  regular  grids,  gratings  with 
degenerate  wave  vectors  may  be  recorded.  Desired 
interconnections  that  have  degenerate  grating  wave 
vectors  can  have  severely  distorted  weights  during 
reconstruction.  This  form  of  cross  t^  may  be 
alleviated  by  placing  the  pixels  on  fractal  sampling 
grids. The  cost,  however,  is  the  need  to  subsample 
the  input,  training,  and  output  planes,  which  de¬ 
creases  the  interconnection  density  of  the  astern. 

The  second  form  of  cross  talk  is  beam-degeneracy 
cross  talk,  which  arises  from  degeneracies  in  the  wave 
vectors  of  beams  diffracted  from  different  grat- 
ings.®-!*-**-**  This  form  of  cross  talk  is  inherent  in 
fan-out/fan-in  volume  holographic  interconnection 
system.s  in  which  the  diffracted  beams  that  constitute 
a  given  fan-in  exit  collinearly  from  the  holographic 
medium;  it  is  present  regardless  of  the  sampling  grids 
used  for  the  input  and  training  planes.  In  such 
interconnection  systems,  beam-degeneracy  cross  talk 
is  also  present  even  in  the  absence  of  cross  gratings. 
For  readout  with  mutually  coherent  beams  in  the 
single-source  architecture,  our  simulation  results  in¬ 
dicate  that  beam  degeneracy  is  a  significant  source  of 
fidelity  error  only  when  the  gratings  are  overmodu¬ 
lated  Further  simulation  results  indicate  that  beam 
degeneracy  is  the  primary  physical  mechanism  respon¬ 
sible  for  the  incoherent  fan-in  loss  observed  in  single¬ 
source  architectures  when  readout  is  performed  with 
mutually  incoherent  beams.'®  **  ®* 

Kelat^  effects  that  can  be  attributed  to  beam 
degeneracy  have  been  observed  for  2-to-l  beam  com¬ 
bining  using  a  coupled-wave  analysis.***®  In  addi¬ 
tion,  Lee  et  al.  have  estimated  the  magnitude  of 
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^flP^neracy  cn,ss  talk  (identified  therein  as 
cross  talk)  insofar  as  it  affects  reconstruc- 
*^rfideUty  in  a  limiting  case.'^  Similarly,  Slinger 
^nntions  that  beam  degeneracy  (identified  therein  as 
^e  of  several  forms  of  multiple-grating  interactions) 
is  a  potential  source  of  cross  talk.** 


3.  Modeling  Methodology  and  Assumptions 


In  this  section  we  describe  our  modeling  methodology 
and  assumptions  in  detail.  The  motivation  behind 
our  choice  of  modeling  technique  (the  optical  beam- 
propagation  method,  or  BPM)  is  discuss^  in  Section 
3-A,  followed  by  a  brief  outline  of  the  pertinent 
features  of  the  BPM  algorithm  in  Section  3.B.  In 
Section  3.C  we  discuss  the  geometric  dimensions  of 
the  single-source  interconnection  architecture  that 
we  modeled,  the  recording  characteristics  of  the 
holographic  medium,  and  the  method  used  to  obtain 
the  wei^t  matrices  for  the  various  cases. 


A.  Choice  of  Simulation  Method 

The  analysis  of  the  diffraction  properties  of  a  large 
number  of  weighted  gratings  multiplexed  in  a  volume 
holographic  material  has  been  a  difficult  problem. 
Several  techniques  have  been  used  in  the  past  to 
analyze  multiple-grating  diffraction. Of  these 
techniques,  coupled-wave  theory  has  been  the  most 
extensively  used.**-'’®^  ^plication  of  this  method 
often  involves  making  a  number  of  simplifying  as¬ 
sumptions  so  that  analytical  or  numeric^  solutions 
are  more  easily  obtainable.  Such  assumptions  in¬ 
clude  the  use  of  a  2-D  model  (in  which  the  gratings 
and  the  incident  and  diffracted  beams  all  lie  in  a 
plane)  and  TE  polarization  of  the  beams,  With 

these  assumptions  and  the  additional  assumption  of 
no  undesired  cross  gratings,  analytical  solutions  have 
been  obtained  for  the  cases  of  l-to-Af  weighted  fan¬ 
outs,®®  iV-to-1  weighted  fan-ins,®*  and  iV-to-N  weighted 
fan-out/ fan-ins.**  In  Ref.  19  the  assumption  was 
further  made  that  the  weight  matrix  had  a  rank  of 
one,  which  is  achieved  in  practice  in  a  linear  holo¬ 
graphic  medium  by  recording  each  Xj  Nvith  the  same 
training  vector,  8  (i.e.,  only  a  single  training  pair  is 
recorded).  For  weight  matrices  having  a  higher  rank 
(which  is  the  relevant  situation  for  photonic  neural- 
network  implementations)  the  system  of  coupled 
differential  equations  obtained  using  coupled-wave 
analysis  has  not  yielded  an  analytical  solution  and 
thus  must  be  solved  numerically.**  For  example, 
Slinger  has  numerically  modeled  a  5-to-5  interconnec¬ 
tion  system  for  pagewise-seqi’ential  and  fully  sequen¬ 
tial  recording  of  the  desired  interconnection  gratings.  ** 
However,  numerical  solutions  for  the  simultaneous 
recording  case  have  not  been  presented  in  the  litera¬ 
ture  to  our  knowledge. 

Rather  than  obtaining  numerical  solutions  based 
on  the  results  of  a  coupled-wave  analysis,  we  used  the 
optical  beam  propagation  method'-®  to  numerically 
model  readout  of  a  volume  hologram  in  which  multi¬ 
ple  interconnection  gratings  are  stored.  An  advan¬ 
tage  of  this  approach  is  that  fewer  simplifying  as¬ 


sumptions  are  necessary  to  make  the  problem 
computationally  tractable.  In  coupled-wave  analy¬ 
sis,  for  example,  only  Bragg-matched  interactions 
between  the  readout  (and  diffracted)  beams  and  the 
recorded  gratings  are  typically  retained.  In  Ref.  19 
this  led  to  the  assumption  that  all  undesired  cross 
gratings  for  pagewise-sequential  recording  operate  in 
the  Bragg  regime.  However,  in  many  physi^  geom¬ 
etries  (depending  on  the  incidence  angles  of  the 
writing  beams  and  the  thickness  of  the  holographic 
recording  material)  some  or  all  of  these  cross  gratings 
may  actually  operate  in  the  Raman-Nath  diffraction 
regime  or  have  properties  that  are  in  the  transition 
regime  between  Raman-Nath  and  Bragg  diffraction 
(hereinafter  referred  to  as  the  transition  regime). 
For  example,  the  interconnection  geometry  that  we 
simulate  (the  dimensions  of  which  are  discussed  in 
Section  3.C.1)  has  cross  gratings  that  operate  in  all 
three  of  the  possible  diffraction  regimes. 

Coupled-wave  theory  could  be  used  to  model  cross 
gratings  that  operate  in  the  Raman-Nath  or  transi¬ 
tion  diffraction  regimes  if  multiple  diffraction  orders 
for  each  cross  grating  are  retain^  in  the  calculations. 
However,  BPM  has  proved  to  be  a  significantly  faster 
numerical  technique  for  solving  diffraction  grating 
problems  involving  many  spectral  orders  when  the 
same  number  of  orders  are  considered  and  the  same 
level  of  accuracy  is  required.*® 

Restricting  the  analysis  to  only  Bragg-matched 
interactions  using  the  coupled-wave  approach  also 
neglects  a  further  possible  source  of  cross  talk,  namely, 
diffraction  from  non-Bragg-matched  interconnection 
gratings.  By  way  of  contrast,  diffraction  from  non- 
Bragg-matched  gratings  is  incorporated  directly  in 
the  BPM  algorithm.  Although  the  effects  of  such 
non-Bragg-matched  interconnection  gratings  are 
small,  we  show  in  Section  4.E  that  diffraction  from 
such  gratings  is  the  limiting  source  of  fidelity  error  as 
the  strength  of  the  interconnections  in  a  single¬ 
source  architecture  approaches  zero. 

B.  Optical  Beam  Propagation  Method 
The  motivation  for  our  choice  of  computational 
method  is  discussed  above;  herein  we  give  a  brief 
qualitative  overview  of  the  BPM  algorithm.  Refer¬ 
ences  20-23  include  discussions  of  the  derivation 
and/or  the  validity  of  BPM  for  various  diffraction 
problems.  The  BPM  analysis  discussed  herein  con¬ 
siders  two  physical  dimensions  [the  nominal  propaga¬ 
tion  (z  axis)  and  transverse  (x  axis)  dimensions  shown 
in  Fig.  3]  and  assumes  TE  polarization  for  conve¬ 
nience. 

The  optical  beam  propagation  method  simplifies 
the  analysis  of  volume  grating  diffraction  by  replacing 
the  physically  distributed  modulation/diffraction 
problem  with  a  sequence  of  infinitesimally  thin  mod¬ 
ulation  layers  separated  by  homogeneous  regions  in 
which  only  diffraction  occurs.  If  a  sufficient  number 
of  modulation  layers  are  incorporated  in  the  calcula¬ 
tion,  the  system  can  closely  approximate  the  charac¬ 
teristics  of  a  volume  hologram,  The  calculation 
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caused  by  reductions  in  the  grating  modul  ition  depth 
during  recording.'* 

Regardless  of  the  practicality  of  pagewise-sequen- 
tial  recording,  our  modeling  of  a  lO-to-10  single- 
source  architecture  shows  that  if  the  beam  splitter 
ratio  is  made  large  enough  to  reduce  the  magnitudes 
of  the  cross  gratings  relative  to  the  interconnection 
gratings,  then  the  fidelity  and  throughput  of  the 
interconnection  system  can  be  improved  as  compared 
with  the  simultaneous-recording  method.  For  exam¬ 
ple,  Fig.  7  shows  the  diffracted  outputs,  ratios,  and 
ratio  percentage  errors  for  readout  vrith  mutually 
coherent  beams.  A  beam  splitter  ratio  of  1000  is 
assumed,  which  adjusts  the  average  strength  of  the 
interconnection  gratings  to  approximately  three  times 
the  strength  of  the  cross-talk  gratings.  As  seen  in 
Fig.  7(c),  the  percentage  errors  of  the  ratios  show  a 
marked  improvement  over  the  simultaneous-record¬ 
ing  case.  However,  at  the  peak  throughput  of  nearly 
80%  (see  Fig.  6),  the  largest  percentage  errors  in  the 
ratios  are  still  50%.  At  10%  throughput  the  largest 
error  is  ~  15%.  As  shown  in  Section  4.C,  the  fidelity 
can  be  further  improved  when  the  cross  gratings  are 
eliminated  entirely  by  using  fully  sequential  record¬ 
ing. 


Fig.  7.  Same  as  Fig.  5  but  for  the  pagewise-sequentiaJ  recording 
method  with  a  beam  splitter  ratio  R  of  1000. 


Because  increasing  the  beam  splitter  ratio  can 
reduce  the  effects  of  the  cross  gratings  for  the  page- 
wise-sequential  recording  method,  one  mi^t  ask 
whether  increasing  the  beam  splitter  ratio  for  the 
simultaneous-recording  method  in  the  sin^-source 
architecture  offers  any  benefits.  As  can  be  seen  by 
comparing  Eqs.  (24H26)  for  the  grating  strei^hs  of 
the  interconnection  cross  gratings  for  the  simulta¬ 
neous-recording  method,  increasing  the  beam  splitter 
ratio  reduces  the  relative  magnitudes  of  the  training- 
plane  cross  gratings  by  a  factor  of  ^  (Eq.  (26)]  as 
compared  with  the  desired  interconnection  gratings 
[Eq.  (24)j.  However,  the  relative  magnitudes  of  the 
input-plane  cross  gratings  increase  by  a  factor  of  ^ 
(Elq.  (25)].  The  net  result  is  that  varying  the  beam 
splitter  ratio  from  unity  always  enhances  one  set  of 
cross  gratings  relative  to  the  desired  set  of  intercon¬ 
nection  gratings.  Our  simulations  show  that  this 
results  in  significantly  reduced  fidelity  and  throu^- 
put  performance.  Thus  a  unity  beam  splitter  ratio  is 
optimal  for  the  simultaneous  recording  method. 

C.  Fully  Sequential  Recording  and  Readout  with  Mutually 
Coherent  Beams 

Fully  sequential  recording  with  M  training  pairs 
results  in  a  refractive-index  distribution  in  the  holo¬ 
graphic  medium  of 

N  N 

An(r)  =  2  S  cos(K^  •  r),  (34) 

i-i >-i 

in  which  there  are  no  undesired  cross  gratings.  The 
corresponding  grating  strengths  are  given  by  Eq.  (24). 

Simulation  results  for  readout  with  mutually  coher¬ 
ent  beams  are  shown  in  Fig.  8.  The  beam  splitter 
ratio  R  is  assumed  to  be  unity  during  recording  since 
there  are  no  cross  gratings  to  minimize.  Each  dif¬ 
fracted  output  [Fig.  8(a)]  appears  more  sinusoidal  in 
nature  as  a  function  of  grating  strength  than  that  for 
either  the  pagewise-sequential  or  simultaneous  record¬ 
ing  cases  discussed  above.  However,  the  first  peak 
(and  the  following  minimum)  of  each  difiracted  out¬ 
put  occurs  at  a  different  grating  strength,  with  the 
result  that  the  reconstruction  fidelity  is  somewhat 
grating-strength  dependent  [as  indicated  in  Fig.  8(b)]. 
This  result  is  similar  to  the  behavior  noted  in  Fig.  5  of 
Ref.  19  for  a  5-to-5  interconnection,  which  was  mod¬ 
eled  numerically  using  a  coupled-wave  approach. 
The  percentage  error  of  the  ratios,  shown  in  Fig.  8(c), 
shows  a  significant  improvement  over  the  pagewise- 
sequential  and  simultaneous  recording  cases  that  is 
attributable  to  elimination  of  the  cross  gratings. 
At  the  peak  throughput  of  >85%  (see  Fig.  6)  the 
largest  ratio  percentage  error  is  15%.  For  10% 
throughput  the  largest  error  is  only  3%. 

Although  the  fidelity  and  throughput  performance 
are  greatly  improved  when  the  cross  gratings  are 
eliminated,  the  fully  sequential  recording  method  has 
several  serious  difficulties  for  practical  systems,  in¬ 
cluding  large  numbers  of  recording  steps,  inefficient 
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Fij;.  8.  Single-source  architecture  simulation  results  for  the 
pagewise-sequential  recording  method  for  a  beam  splitter  ratio,  R, 
of  1000;  (a)  diffracted  outputs,  (b)  ratio  of  each  dif&acted  output 
to  an  arbitrarily  chosen  output,  and  (c)  percentage  error  of  each 
ratio.  Readout  is  performed  with  mutually  coherent  beams. 


Fig.  9.  Single-source  architecture  simulation  results  for  the  fully 
sequential  recording  method:  (a)  diffiracted  outputs,  (b)  ratio  of 
each  diffracted  output  to  an  arbitnurily  chosen  output,  and  (cl 
percentage  error  of  each  ratio.  Readout  is  performed  with  mutually 
Coherent  beams. 


use  of  the  illuminating  beams,  and  complicated  record¬ 
ing  schedules  (see  Section  2.D). 

D.  Readout  with  Mutually  Incoherent  Beams 

To  this  point  in  the  discussion  we  have  considered 
readout  of  single-source  architectures  with  mutually 
coherent  beams  only.  In  order  to  better  understand 
the  innovations  incorporated  in  the  incoherent/ 
coherent  double  angularly  multiplexed  architecture 
discussed  in  Section  5,  we  briefly  discuss  readout  of 
the  single-source  architecture  with  mutually  incoher¬ 
ent  beams.  For  the  purposes  of  the  section  we 
consider  the  fully  sequent!^  recording  method,  such 
that  only  the  desired  interconnection  gratings  are 
present  in  the  holographic  medium. 

Simulation  results  for  the  diffracted  outputs  in  this 
case  are  shown  in  Fig.  9.  The  reconstruction  fidelity 
is  essentially  the  same  as  for  readout  with  mutually 
coherent  beams  (Fig.  8),  even  though  the  ideal  out¬ 
puts  are  expressed  in  this  case  by  Eq.  (4)  instead  of 
Eq.  (6).  Since  beam  degeneracy  is  always  present  in 
the  single-source  architecture,  the  low  fidelity  error 


results  shown  for  both  mutually  incoherent  and 
ojherent  readout  beams  indicate  that  beam  degener- 
a<y  is  not  a  significant  source  of  fidelity  error  (at  least, 
not  for  grating  strengths  up  to  ~  0.4  rad,  at  which  the 
peak  in  throughput  occurs).  However,  for  readout 
with  mutually  incoherent  beams  the  peak  through¬ 
put  drops  to  10%  (as  shown  in  Fig.  6)  because  of 
incoherent  fan-in  loss,  which  limits  the  maximum 
throughput  to  l/N  for  an  N-to-N  interconnection 
system  with  collinear  output  summation.®* 

An  understanding  of  the  physical  mechanism  re¬ 
sponsible  for  incoherent  fan-in  loss  in  a  single-source 
holographic  interconnection  system  permits  us  to 
design  a  system  that  avoids  this  loss  (as  discussed  in 
Section  5).  The  fan-in  loss  in  a  single-source  archi¬ 
tecture  for  readout  with  incoherent  beams  can  be 
explained  by  the  beam  degeneracy  that  is  present  in 
the  diffracted  outputs.®  ’®  *®-®^  Beam  degeneracy  re¬ 
fers  to  the  k-vector  degeneracies  in  beams  diffracted 
from  different  gratings  in  holographic  interconnec¬ 
tion  systems  that  have  collinear  fan-in.  This  can  be 
understood  by  reference  to  Fig.  10(a),  in  which  a 
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Fig.  10.  (a)  Schematic  diagram  of  the  recording  geometry  of  a 
4-to-4  interconnection  system;  (b)  schematic  diagram  of  the  read¬ 
out  geometry  of  the  4-to-4  intercoimection  system  with  a  single 
beam  (xg)  and  the  resulting  outputs;  (c)  simulation  results  for 
readout  of  the  4-to-4  interconnection  system,  in  which  the  power 
cross  coupled  by  beam  degeneracy  from  the  desired  outputs  to  the 
zi',  zj',  andz^'  beams  can  be  significant. 


4-to-4  interconnection  system  is  represented  schemat¬ 
ically.  As  in  Fig.  4,  each  collimated  beam  is  repre¬ 
sented  by  its  k  vector.  When  readout  is  performed 
using  all  four  input  beams  four  reconstructed 

beams  emerge  collinearly  in  each  of  the  four  p,' 
directions.  Thus  the  four  beams  that  £u-e  diffract^ 


in  a  given  p,‘  direction  have  degenerate  wave  vecton.; 
i.e..  the;y  are  degenerate  beams. 

To  understand  how  this  degeneracy  leads  to  a 
fan-in  loss  for  readout  with  mutually  incoherent 
beams,  consider  readout  of  the  4-to-4  interconnection 
system  described  above  with  a  single  input  beam,  Xj. 
As  shown  by  the  solid  arrows  in  Fig.  10(b),  diffracted 
outputs  are  generated  in  each  of  the  four  output 
directions  (Pi'-P4')  as  well  as  the  zero-order  direction 
X3'.  However,  each  diffracted  beam  p,'  is  in  turn 
diffracted  into  the  directions  X|‘,  xj',  and  X4'  by  the 
gratings  recorded  among  the  training  beams  81-64 
and  the  other  input  beams  X|,  xj,  and  X4.  These 
interactions  cause  power  to  be  coupled  out  of  the 
desired  outputs  p,‘  and  into  the  zero  orders  of  Xi,  Xj, 
and  X4,  generating  the  cross-coupled  beams  Xi',  xj', 
andx4'. 

The  magnitude  of  this  effect  can  be  illustrated  by 
modeling  a  4-to-4  interconnection  system  (using  the 
BPM)  in  which  a  single  beam  of  unit  intensity 
[corresponding  to  xs  in  Fig.  10(b)]  is  used  for  readout. 
Uniform  weights  are  assumed  (W^  =  1  for  all  t  andj), 
and  the  geometry  of  the  system  u^  in  the  modeling 
is  the  same  as  that  described  in  Section  3.C.I.  The 
results  of  the  BPM  calculation  are  shown  in  Fig. 
10(c),  in  which  the  diffraction  efficiencies  of  all  of  the 
diffiracted  beams  are  shown  as  a  function  of  grating 
strength.  The  diffraction  efficiencies  of  the  desired 
outputs  Pi'  are  indicated  with  solid  curves,  vdiile  the 
zero-order  and  cross-coupled  beams  are  represented 
by  the  dotted  and  dashed  curves,  respectiv^y.  The 
peak  diffraction  efficiency  into  the  desired  outputs  pi' 
occurs  at  ^  0.8  rad.  The  summed  power  in  tibe  four 
desired  outputs  is  only  24%,  which  is  1/4  (or  1/N)  of 
the  maximum  available  power.  The  rest  of  the 
power  either  remains  in  the  zero  order  (xs')  or  is 
diffracted  into  the  cross-coupled  bet:  nos  (x/,  xj',  and 
X4'),  each  of  which  has  approximately  the  same  diffrac¬ 
tion  efficient^  as  the  desired  beams  (at  0.8  rad). 

It  is  clear  that  if  the  other  three  input  beams  (xi,  X2, 
and  X4)  are  also  incident  and  that  if  all  of  the  incident 
beams  are  mutually  incoherent,  then  the  summed 
diffraction  efficiency  into  the  desired  outputs  is  still 
only  25%  at  best,  because  the  light  in  each  output 
beam  adds  incoherently.  The  end  result  in  this  case 
is  analogous  to  that  corresp>onding  to  the  coUinear 
combination  of  N  mutually  incoherent  quasi-mono- 
chromatic  beams  of  essentially  identical  wavelengths 
using  iV  —  1  beam  sphtters;  only  1/N  of  the  power  at 
most  can  be  diverted  into  the  desired  direction,  while 
the  remaining  jx)wer  exits  the  system  in  the  direction 
of  the  original  input  beams.*^ 

However,  as  iUustrated  in  the  simulation  results  of 
Section  4.C,  it  is  possible  to  achieve  high  optical 
throughput  in  the  single-source  architecture  if  the 
incident  beams  are  mutually  coherent  and  the  proper 
phase  conditions  are  satisfied.  Alternatively,  if  a 
holographic  interconnection  system  is  design^  such 
that  each  detected  output  consists  of  angularly  in¬ 
stead  of  collinearly  fanned-in  beams,  the  throughput 
loss  that  is  due  to  the  effects  of  beam  degeneracy  can 
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be  avoided  even  for  readout  with  mutually  incoherent 
'  beams.  The  inooherent/coherent  double  angt^ly 
multiplexed  architecture  rfiiifinMMwi  in  Section  5  illus¬ 
trates  one  method  of  achieving  such  angular  fan-in  at 
each  pixel  in  the  output  plane  with  consequent  high 
throughput  for  readout  with  mutually  incoherent 
beams. 

E.  Discussion 

1.  Comparison  of  Recording  Methods  Using  an  rms 
Fidelity  Error 

The  percentage-error  ratio  metric  used  above  to 
assess  the  reconstruction  fidelity  of  the  single-source 
architecture  under  various  recording  conditions  is 
attractive  in  that  it  graphically  illustrates  the  varia¬ 
tion  in  fidelity  among  the  individual  diffracted  outputs. 
However,  a  lumped  error  measure  is  more  convenient 
for  comparing  hot  only  different  recording  methods 
for  the  single-souroe  architectiuc  but  also  for  compar¬ 
ing  the  reconstruction  fidelity  of  different  architec¬ 
tures. 

One  possible  error  measure  is  given  by'* 

inwhichd  =  p/|p|  is  the  unit  vector  in  the  direction  of 
p;  u'  is  defined  similarly.  The  error  measure  e  can 
be  interpreted  as  the  rms  error  of  the  components  of 
the  normalized  diffracted  output  vector  d'.  The 
separate  normalizations  of  the  ideal  and  actual  compo¬ 
nents  by  IpI  and  |p'  |,  respectively,  permit  a  change  in 
the  throu^put  that  does  not  in  turn  bias  the  relative 
fidelity  of  the  components.  The  maximum  value  of 
the  rms  error  e  is  >/2  since  d  and  d’  are  imit  vectors 
having  unipolar  components. 

The  rms  errors  of  the  various  recording  methods 
discussed  in  Sections  4  A— 4.D  for  the  lO-to-10  simula¬ 
tions  of  the  single-source  architecture  are  shown  in 
Fig.  11  as  a  function  of  the  grating  strength  of  the 
largest  interconnection  grating.  For  simultaneous 
recording  the  rms  error  starts  out  relatively  small  at 
essentially  zero  grating  strength  (and  practically  no 
throughput,  as  shown  in  Fig.  6),  and  then  becomes 
large  quite  rapidly  as  the  grating  strength  is  increased. 
If  just  the  cross  gratings  among  the  input-plane  pixels 
are  removed  using  pagewise-sequential  recording  with 
R  =  -  100  (i.e.,  only  the  desired  interconnection 

gratings  and  the  training-plane  cross  gratings  are 
present,  with  comparable  grating  strengths),  neither 
the  fidelity  nor  the  throughput  improves  significantly. 
Decreasing  the  magnitudes  of  the  training-plane  cross 
gratings  relative  to  the  desired  interconnection  grat¬ 
ings  (using  pagewise-sequential  recording  with  R  = 
1000)  results  in  substantial  improvement  in  both  the 
fidelity  and  the  peak  throughput. 

Complete  elimination  of  the  cross  gratings  by  using 
sequential  recording  yields  a  further  significant  im¬ 
provement  in  fidelity  and  a  marginal  increase  in  the 
peak  throughput,  as  shown  in  Fig.  6.  The  reconstruc- 


Fig.  11.  Simulation  resulta  allowing  the  rma  fidelity  error  |aa 
defined  in  Eq.  (35)|  for  aeveral  oombinationa  of  recording  and 
readout  methoda  for  the  ain^e-aouroe  architecture.  An  explana¬ 
tion  of  the  legend  ia  provided  in  Pig.  6. 


tion  fidelity  that  we  obtain  in  the  absence  of  cross 
gratings  is  much  hi^er  than  that  shown  by  Slinger.'* 
This  difference  can  be  understood  as  follows.  Sling- 
er’s  analysis  considers  both  random  complex  weights 
(i.e.,  each  interconnection  grating  has  a  random 
phase  as  well  as  a  random  amplitude)  and  random- 
amplitude  readout  beams  that  have  either  a  0*  or 
180”  relative  phase.  The  direct  implication  is  that 
the  readout  beams  do  not  have  the  same  relative 
phases  as  the  recording  beAms  that  were  used  to 
create  the  weights  [Le.,  the  argiunent  of  the  exponen¬ 
tial  in  Eq.  (7)  is  not  constant  for  all  i  and  j,  su^  that 
condition  (1)  discussed  following  Eq.  (6)  in  Section  2.C 
is  violated].  The  net  result  is  poor  reconstruction 
fidelity. 

In  our  analysis  of  the  single-source  interconnection 
architecture  the  readout  beams  have  the  same  rela¬ 
tive  phases  as  the  recording  beams,  which  permits  the 
appropriate  phasing  conditions  to  be  satisfied  upon 
reconstruction  such  that  good  fidelity  results.  This 
same-relative-phase  condition  is  consistent  with  the 
implementation  of  unipolar  weights  and  outputs,  as 
is  discussed  in  Section  2.C  (recall  that  a  dual-rail 
strategy  can  be  generalized  to  the  bipolar  case  instead 
of  requiring  the  interconnection  system  to  implement 
bipolarity  directly).  A  comparison  of  our  results 
with  Slinger’s  indicates  that  (in  the  absence  of  cross 
gratings)  a  single-source  architecture  can  in  fact  yield 
high  reconstruction  fidelity  for  an  appropriate  map¬ 
ping  of  neural  interconnection  requirements  to  the 
architecture. 

2.  Effects  ofSidelobe  Overlap  on  Reconstruction 
Fidelity 

A  curious  feature  of  the  simulation  results  shown  in 
Fig.  11  is  that  the  fidelity  error  for  each  recording 
method  does  not  go  to  zero  with  decreasing  grating 
strength.  Furthermore,  the  asymptotic  value  of  the 
fidelity  error  near  zero  grating  strength  is  the  same 
for  each  recording  method..  This  indicates  that  the 
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liiiMting  value  of  the  errcr  is  independent  of  the 

■  presence  or  absence  of  cross  gratings. 

The  cause  of  this  somewhat  counterintuitive  behav¬ 
ior  appears  to  be  diffraction  from  non-Bragg-matched 
interconnection  gratings.*"  The  physical  mecha- 

Inism  for  this  effect  is  illustrated  in  Fig.  12,  in  which 
the  +1 -order  diffraction  efficiency  of  two  sinuso'''o| 
phase  gratings  is  shown  as  a  function  of  the  incidence 

■  angle  of  a  plane-wave  readout  beam.  The  ^tings 
are  assum^  to  have  the  same  grating  period,  but 
their  fringes  are  oriented  at  different  slant  angles 
relative  to  the  front  face  of  the  holographic  medium. 

Ilf  a  readout  beam  is  incident  at  6|  (Bragg  matched  to 
grating  G|),  it  is  diffracted  by  both  G|  (through  G|'s 
mainlobe  response)  and  grating  Gj  (through  Gj's 
sidelobe  response  at  6i).  Since  G|  and  G2  have  the 

I  same  grating  period,  the  light  beams  diffracted  from 
both  gratings  are  coUinear  upon  exiting  the  holo¬ 
graphic  medium.  Hence  the  overall  diffraction  re- 

Isponse  at  incidence  angle  6,  is  composed  of  a  small 
contribution  from  G2  as  well  as  the  main  contribution 
from  Gj.  Upon  coherent  addition  of  the  two  contri¬ 
butions  the  net  diffraction  efficiency  in  general  differs 

■  from  what  it  would  be  in  the  absence  of  G2.  If  G] 
implements  a  weighted  interconnection,  this  differ¬ 
ence  results  in  a  small  error  in  the  weight  of  the 
grating.  We  refer  to  the  source  of  this  error  as 

■sidelobe  overlap. 

In  the  single-source  architecture  illustrated  in  Figs. 
3  and  4,  there  are  multiple  sets  of  gratings  that  have 

I  the  stune  grating  period  but  different  slant  ai^es 
because  of  the  existence  of  multiple  pairs  of  writing 
beams  originating  from  the  pixels  of  the  input  and 
training  planes  that  have  equal  angular  separations. 

■For  example,  the  grating  written  by  beams  Xi  and  81 
(Fig.  4)  has  the  same  grating  period  (but  a  different 
slant  ang^e)  as  the  grating  written  by  beams  X2  and  82. 
As  a  result  of  the  presence  of  sidelobe  overlap  among 
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Fig.  12.  Angular  response  characteristics  of  two  Bragg  gratings 
that  have  the  same  grating  period  and  slightly  different  slant 
[angles.  Although  the  main  lobes  of  the  angular  responses  are  well 
separated,  the  sidelobes  and  the  main  lobes  overlap. 
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various  sets  of  gratings,  one  would  expect  the  w<  ights 
of  the  affected  gratings  to  exhibit  a  sr  tail  residual 
error  upon  reconstruction.  This  variation  .in  the 
recorded  weights  should  be  present  even  as  the 
grating  strength  goes  to  zero  b^use  both  the  main 
Bragg  response  of  a  particular  grating  and  the  overlap¬ 
ping  sidelobes  of  the  other  gratings  scale  in  propor¬ 
tion  to  each  other  as  a  function  of  grating  strength. 

One  method  of  testing  the  assertion  that  sidelobe 
overlap  affects  the  reconstruction  fidelity  as  described 
above  is  to  vary  the  thickness  of  the  holographic 
medium  such  that  the  d^ree  of  sidelobe  overlap 
among  the  various  interconnection  gratings  is  changed 
(since  the  angular  width  of  the  diffraction  response  of 
each  grating  is  inversely  proportional  to  the  thick¬ 
ness*^).  If  sidelobe  overlap  is  present,  one  would 
expect  the  limiting  value  of  the  rms  fid^ty  error  to 
generally  increase  with  decreasing  thickness,  and  vice 
versa.  This  behavior  is  exactly  what  we  observe  in 
our  simulations. 

For  the  simultaneous  and  pagewise-sequential  cases 
shown  in  Fig.  11,  the  effect  of  sidelobe  ovet^p  is 
apparent  only  at  small  grating  strengths  because  the 
cross  gratings  are  the  main  source  of  fidelity  error  at 
larger  grating  strengths.  For  the  fully  sequential 
recording  case  with  coherent  readout,  sidelobe  over¬ 
lap  appears  to  be  the  dominant  source  of  reconstruc¬ 
tion  error  up  to  ~0.25  rad.  As  borne  out  other 
simulation  results,  the  reconstruction  fidelity  in  this 
r^ime  can  be  further  improved  simply  by  increasing 
the  thickness  of  the  holographic  m^um. 

3.  Scaling  Trends 

To  examine  how  our  modeling  results  scale  with  the 
number  of  nodes  in  the  interconnection  ^stem,  we 
also  simulated  a  4-to-4  sin|^e-source  interconnection 
architectiure  having  a  4  x  4  weight  matrix  that  is  a 
subset  of  the  10  x  10  weight  matrixused  above.  The 
corresponding  four  input  beams  from  x**""  are  used  to 
read  out  the  interconnection  system.  Simulation 
results  for  the  rms  fidelity  error  of  the  normalized 
output  vectors  and  the  throughput  for  the  4-to-4  case 
are  shown  in  Figs.  13(a)  and  13(b),  respectively,  for 
the  simultaneous,  pagewise-sequential,  and  fully  se¬ 
quential  recording  methods. 

The  overall  performance  characteristics  noticed  in 
the  lO-to-10  simulations  are  also  present  in  the  4-to-4 
results,  in  that  the  rms  fidelity  errors  and  throu^- 
puts  for  the  various  cases  show  no  identifiable  trends 
for  scale-up  from  the  4-to-4  case  to  the  lO-to-10  case. 
As  a  specific  exzunple,  the  simulation  results  for  the 
relative  behaviors  of  the  pagewise-sequential  and 
simultaneous  recording  methods  in  the  4-to-4  case 
are  similar  to  the  lO-to-10  results  (see  Fig.  13).  The 
rms  fidelity  error  for  the  simultaneous  recording 
method  in  both  cases  increases  rapidly  as  the  grating 
strength  increases  from  zero.  When  the  input-plane 
cross  gratings  are  removed  (i.e.,  when  pagewise- 
sequential  recording  is  used  with  jR  =  16  for  the 
4-to-4  case  and  R  =  100  for  the  lO-to-10  case),  neither 
the  fidelity  nor  the  throughput  improves  significantly 
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fig.  13.  Simulation  results  for  a  4-to-4  single-source  ardiitecture. 
Shown  are  (a)  the  rms  fidelity  error  for  various  recording  and 
readout  combinations  and  (b)  the  optical  throughput  for  the 
various  recording  and  readout  combinations.  Clarification  of  the 
legend  [in  (b),  from  top  to  bottom]  is  as  follows:  simultaneous 
recording  method  with  coherent  readout,  pagewise-se<)uential  re¬ 
cording  method  with  a  beam  splitter  ratio  of  16  and  coherent 
readout,  pagewise-sequential  recording  method  with  a  beam  split¬ 
ter  ratio  of  160  and  coherent  readout,  fully  sequential  recording 
with  coherent  readout,  and  fully  sequential  recording  with  incoher¬ 
ent  readout. 


in  either  case.  Increasing  the  strength  of  the  desired 
interconnections  to  approximately  three  times  the 
training-plane  cross  gratings  (i.e.,  when  pagewise- 
sequenti^  recording  is  used  with  R  =  160  for  the 
4-to-4  case)  reduces  the  fidelity  error  and  significantly 
increases  the  peak  throughput,  just  as  in  the  lO-to-10 
case.  In  fact,  the  rms  fidelity  error  is  nearly  the 
same  for  both  simulations  at  a  grating  strength 
corresponding  to  the  peak  throughput  in  each  case. 
At  the  current  level  of  simulation  complexity  it  is  not 
yet  clear  whether  this  apparent  insensitivity  to  the 
dimensions  of  the  interconnection  system  is  general- 
izable. 

In  addition  to  these  similarities,  there  are  a  few 
interesting  differences  among  these  particular  4-to-4 
and  lO-to-10  cases.  For  simultaneous  recording  the 


4-to-4  case  shows  a  significant  drop  in  the  rms  fidelity 
error  at  1.2  rad  (whidi  happens  to  correspond  to 
50%  throughput).  This  result  is  weight-mtrix  and 
input-vector  dependent  and  cannot  be  relied  on  to 
occur  in  gener^.  Similar  comments  apply  to  the 
observed  drop  in  fidelity  error  for  the  /2  «  16  pagewise- 
sequential  recording  case  that  occurs  at  grating 
strengths  in  excess  of  0.85  rad.  Despite  these  differ¬ 
ences,  the  general  trends  do  not  appear  to  be  weight- 
matrix  and  input-vector  dependent. 

Furthermore,  as  the  grating  strength  goes  to  zero 
the  fidelity  error  for  the  4-to-4  case  (Fig.  13(a)]  is  the 
same  for  all  of  the  recording  methods,  as  in  the 
lO-to-10  cases.  However,  the  actual  value  of  the 
error  is  somewhat  smaller  for  the  lO-to-10  results 
than  for  the  4-to-4  simulations.  This  difference  is 
probably  caused  by  the  increased  number  of  overlap¬ 
ping  sidelobes  that  affect  a  particular  weighted  inter¬ 
connection  in  the  lO-to-10  case.  As  the  number  of 
overlapping  sidelobes  increases,  their  effect  on  the 
interconnection  fidelity  may  tend  to  decrease  because 
the  sign  of  each  sidelobe’s  contribution  to  the  weighted 
interconnection  can  be  either  positive  or  negative, 
depending  on  which  particular  sidelobe  is  accessed  for 
the  off-Bragg  grating.  The  contributions  of  a  large 
number  of  sidelobes  may  therefore  tend  to  average  to 
zero.  The  implication  is  that  the  limiting  fidelity 
error  that  is  due  to  sidelobe  overlap  should  decrease  as 
the  number  of  nodes  in  the  interconnection  architec¬ 
ture  increases  (at  least  until  some  other  limiting 
phenomenon  is  reached).  This  should  result  in  bet¬ 
ter  fidelity  performance  for  the  sequential  recording 
method  for  throughputs  at  whii^  the  fidelity  is 
limited  by  sidelobe  overlap. 

F.  Single-Source  Architecture  Simulation  Results: 
Conclusions 

Our  simulation  results  show  that,  as  expected,  the 
coherent-recording  cross-talk  gratings  for  simulta¬ 
neous  recording  in  a  single-source  architecture  cause 
a  significant  degradation  in  reconstruction  fidelity  for 
reasonable  throughputs.  Pagewise-sequential  re¬ 
cording  shows  better  fidelity  and  peak-throughput 
performance  than  simultaneous  recording  if  the  beam 
splitter  ratio  can  be  made  large  enough  to  overcome 
the  undesired  strengthening  of  the  training-plane 
cross  gratings,  which  is  caused  by  the  larger  number 
of  exposures  that  they  receive  relative  to  the  desired 
interconnection  gratings.  Sequential  recordingyields 
both  high  reconstruction  fidelity  and  high  optical 
throughput,  but  at  the  cost  ofN^—  1  more  recording 
steps  than  for  the  case  of  simultaneous  recording. 

Based  on  the  considerations  discussed  in  this  pa¬ 
per,  an  attractive  method  of  implementing  a  single¬ 
source  architecture  is  to  employ  simultaneous  record¬ 
ing  in  a  geometry  that  clearly  separates  the  range  of 
spatial  frequencies  obtained  for  the  desired  intercon¬ 
nection  gratings  from  those  obtained  for  the  cross 
gratings;  and  to  use  a  holographic  material  that  is 
sensitive  to  the  former  range  of  spatial  frequencies 
and  insensitive  to  the  latter. Of  course  this 
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fwumes  that  high  throughput  is  required  in  the 
esultant  interconnection  system.  No  special  re- 
uirements  are  placed  on  the  spatial-frequeni^  sensi¬ 
tivity  of  the  holographic  medium  if  low  throughputs 
■bre  permissible.  In  this  regime  the  simultaneous 
Hrecording  method  yields  the  same  fidelity  perfor¬ 
mance  as  the  other  recording  methods,  all  of  which 
are  limited  by  the  amount  of  angular  sidelobe  overlap 

that  is  present. 

Even  if  a  single-source  architecture  is  implemented 
as  described  above,  it  has  at  least  one  remaining 
characteristic  that  detracts  from  its  potential  use  in 
Kmplementations  of  large-scale  weighted  fan-out 
Bfan-in  interconnection  systems.  As  mentioned  in 
Section  2.E,  the  density  of  pixels  on  the  input  and 
■training  planes  is  limited  to  a  certain  degree  by 
Igrating  degeneracy 

15.  Incoherent/Coherent  Double  Angularly  Multiplexed 
Interconnection  Architecture 

In  this  section  we  discuss  the  operation  of  the 
incoherent/coherent  double  angularly  multiplexed 
■architecture  that  we  have  proposed  and  investigated 
■recently^*^  and  compare  its  relative  merits  to  those  of 
sin^e-source  architectures.  Two  configurations  of 

§the  incoherent/coherent  double  angularly  multi¬ 
plexed  architecture  are  presented;  first,  the  full- 
aperture  configuration^'^  and  second,  the  subholo- 
gram  configuration."  '®’'^*®  Both  configurations 
^permit  simultaneous  recording  of  each  training  pair 
■with  significantly  reduced  coherent-recording  cross 
•talk  as  compared  with  the  single-source  architecture. 
In  addition,  readout  is  performed  with  mutually 

■incoherent  beams  such  that,  during  operation,  each 
difiracted  output  is  describe  by  the  usual  neural- 
network  summation  of  Eq.  (4)  rather  than  the  modi¬ 
fied  summation  of  Eq.  (6).  This  is  accomplished 
■without  sacrificing  the  high  throughput  efficiency 
■that  is  typically  associated  only  with  fully  coherent 
systems,  because  both  configurations  avoid  the  pres- 

Ience  of  beam  degeneracy.  Furthermore,  while  grat¬ 
ing  degeneracy  is  present  in  the  full-aperture  configu¬ 
ration  of  the  architecture,  it  can  potentially  be 
eliminated  by  using  the  subhologram  configuration 

I  and  therefore  does  not  require  subsampling  of  the 
input  and  training  planes. 

The  full-aperture  configuration  of  the  inco¬ 
herent/coherent  double  angularly  multiplexed  archi- 

Itectwire  is  described  in  Section  5  A,  and  suuu'.atioii 
results  are  discussed  in  Section  5.B.  Operation  of 
the  subhologram  configuration  of  the  architecture  is 

■discussed  in  Section  5.C,  and  simulation  results 
follow  in  Section  5.D.  Section  5.E  provides  a  compar¬ 
ison  of  the  two  configurations. 


I 

I 


A.  Full-Aperture  Configuration  of  the  Incoherent/Coherent 
Double  Angularly  Multiplexed  Architec;ture:  Operation 
A  schematic  diagram  for  one  layer  of  the  full-aperture 
configuration  of  the  incoherent/coherent  double  an¬ 
gularly  multiplexed  architecture  is  shown  in  Fig. 
14(a).  The  architecture  has  three  key  components; 


Atnf 


Pig.  14.  Schematic  diagram  of  the  fiiU-aperture  configuration  of 
the  incoherent/ooherent  double  angularly  multiplexed  ardiitec- 
ture  showing  (a)  general  layout,  (b)  recording,  and  (c)  reconstruction. 
M|  and  M]  are  mirrors;  Li-Lj  are  lenses;  BSj  is  a  second  beam 
splitter. 


(1)  a  2-D  array  of  individually  coherent  but  mutually 
incoherent  sources;  (2)  optoelectronic  neuron-unit 
arrays  that  integrate  the  functions  of  light  detection, 
neuron-unit  nonlinear  response,  and  optical  modula¬ 
tion  for  each  pixel  (these  arrays  are  denoted  as  SLMi 
and  SLMs);  and  (3)  the  volume  holographic  intercon¬ 
nection  medium.  In  this  section,  we  briefly  review 
the  interconnection  method  used  in  the  architecture. 
Additional  details  are  provided  in  Refs.  12  and  13. 

The  process  of  recording  a  set  of  interconnections  is 
illustrated  in  Fig.  14(b).  Assume  for  the  moment 
that  a  single  pair  of  training  vectors  [x*"'*,  6'"'')  is  to  be 
recorded.  Light  from  each  source  is  split  into  two 
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paths.  In  ihe  upper  path  each  beam  is  colli¬ 
mated  and  illuminates  the  full  aperture  of  the  train¬ 
ing  plane  (SLM|).  Each  beam  therefore  reads  out 
the  training  vector  that  is  present  on  SLM|  (each 
of  these  beams  is  referred  to  herein  as  a  S'"'  beam). 
The  collimated  beims  deriving  from  the  full  set  of 
sources  propagate  at  different  angles  through  the 
training  plane.  In  the  lower  path  the  source  array  is 
imaged  onto  the  input  plane  (SLMj)  such  that  light 
from  each  source  illuminates  only  one  corresponding 
pixel.  A  collimating  lens  converts  the  beams  emerg¬ 
ing  from  the  pixels  of  SLM2  to  a  set  of  angularly 
distinct  collimated  beams  that  illuminate  the  full 
aperture  of  the  volume  holographic  medium. 

As  a  result  of  the  individually  coherent  but  mutu¬ 
ally  incoherent  nature  of  the  sources  in  the  source 
array,  interconnection  gratings  are  formed  only  be¬ 
tween  each  xj"’  smd  the  corresponding  8*"'  beam  with 
which  it  is  mutually  coherent.  This  permits  the 
simultaneous  recording  of  a  set  of  angularly  multi¬ 
plexed  holograms  in  which  each  hologram  is  formed 
by  the  interference  of  an  angularly  distinct  reference 
beam  x|"'  with  a  second  angularly  distinct  beam  (from 
source  Sj)  bearing  the  image  8<"'.  The  full  set  of 
image-bearing  beams  l(8‘">)y|  that  derives  from  all  of 
the  sources  in  the  source  array  (Sy)  and  that  encodes 
the  omtents  of  SLMi  is  also  angularly  multiplexed. 
Hence  we  describe  this  architecture  as  double  angu¬ 
larly  multiplexed. 

Similar  to  the  case  of  simultaneous  recording  in  a 
single-source  architecture,  the  incoherent/coherent 
double  angularly  multiplexed  architecture  requires 
only  one  exposure  to  record  each  training  pair 
8'"'],  which  is  accomplished  by  turning  on  all  of 
the  sources  in  the  source  array  -simultaneously. 
Since  mutually  incoherent  beams  are  used  to  read  ou 
the  pixels  of  SLM2,  no  coherent-recording  cross-talK 
gratings  among  the  input-plane  pixels  are  formed. 
Similarly,  cross  gratings  among  the  separate  beams 
encoded  with  8'"'  do  not  occur.  The  only  cross 
gratings  that  can  form  in  the  holographic  medium 
result  from  overlaps  among  adjacent  diffracted  compo¬ 
nents  5,  within  each  S'"’  beam  in  the  Fresnel  regime. 
Depending  on  the  size  of  the  pixels  and  the  distance 
between  SLMi  and  the  holographic  medium,  these 
cross  gratings  connect  any  single  pixel  only  to  those  in 
some  local  neighborhood  of  the  pixel.  The  effects  of 
such  local  cross  talk  can  be  minimized  by  adjusting 
the  beam  splitter  ratio  R.  This  interconnection  sys¬ 
tem  therefore  permits  simultaneous  recording  of 
each  training  pair  while  minimizing  the  effects  of 
coherent-recording  cross  talk. 

As  illustrated  in  Fig.  14(c),  readout  is  performed 
using  the  lower  optical  path  (with  all  of  the  sources 
turned  on  simultaneously).  The  volume  holographic 
optical  element,  or  VHOE,  performs  the  requisite  set 
of  weighted  fan-outs,  while  the  imaging  lens  following 
the  VHOE  performs  an  optical  fan-in  operation  by 
imaging  the  diffracted  beams  onto  the  pixels  of  the 
output  plane.  The  angularly  distinct  set  of  colli¬ 
mated  beams  that  illuminated  SLM,  during  recording 


is  therefore  reconstructed  by  the  VHOE:  after  pass¬ 
ing  through  the  lens,  the  be^s  form  a  n  al  image  in 
the  output  plane,  which  is  conjugate  to  the  SLM| 
plane.  The  net  result  is  that  a  fan-in  of  angularly 
distinct  incoherent  beams  is  performed  at  each  node 
in  the  output  plane.  As  long  as  the  angular  spread  is 
sufficiently  large,  an  incoherent  fan-in  can  be  per¬ 
formed  without  incurring  the  usual  fan-in  loss  associ¬ 
ated  with  a  collinear  incoherent  fan-in.  If  we  use 
appropriate  optical  elements  (depending  on  the  partic- 
ulv  neural-network  model  being  implemented),  the 
output  plane  shown  in  Fig.  14(c)  may  be  coincident 
with  the  input  side  of  SLMi  itself  [as  shown  in  Fig. 
14(a)],  with  SLM2,  with  the  input  SLM  of  the  next 
layer,  or  with  any  combination  thereof. 

B.  FuH-Apefture  Configuration  of  the  Incoherent/Coherent 
Ooubfe  Angularly  Multiplexed  Architecture:  Simulation 

The  optical  beam  propagation  method  was  used  to 
analyze  the  fidelity  and  throughput  performance  of 
the  full-aperture  configuration  of  the  incoher¬ 
ent/coherent  double  angularly  multiplexed  architec¬ 
ture  by  simulating  a  lO-to-10  interconnection  system 
in  which  the  same  weight  matrix  was  recorded  as 
above  for  the  single-source  architecture  and  in  which 
the  same  input  vector  was  used  for  readout. 
In  all  such  simulations,  readout  was  performed  using 
mutually  incoherent  beams. 

To  facilitate  direct  comparison  with  the  single- 
source  architecture  simulations,  we  assume  that  the 
same  holographic  medium  characteristics  (linear  ma¬ 
terial  with  a  thickness  of  4.5  mm  and  a  refractive 
index  of  2.52)  and  the  same  operating  wavelength 
f 0.5 14  pm)  are  used.  The  separation  of  the  sources 
in  the  source  array  is  the  same  as  the  pixel  separation 
in  SLM|  and  SLM2,  which  is  257  pm.  Instead  of 
using  two  lenses  to  im^  the  source  array  onto  SLM2 
[lenses  Li  and  L2  in  Fig.  14(a)],  we  use  a  single  lens 
only,  which  is  separated  by  twice  its  focal  length  from 
both  the  source  array  and  SLM2.  The  focal  lengths 
of  lenses  L«  and  L3  are  assumed  to  be  50  mm.  The 
separation  between  SLMi  and  the  holographic  me¬ 
dium  is  also  assumed  to  be  50  mm.  The  beam 
splitter  ratio  R  is  unity. 

Given  the  above  parameters,  light  passing  through 
each  pixel  of  SLM]  spreads  approximately  200  pm 
laterally  in  propagating  to  the  holographic  medium 
because  of  Effraction.  Since  this  is  less  than  one 
pixel  width,  we  assume  for  simplicity  that  geometrical 
optics  adequately  describes  the  propagation  of  light 
from  SLMi  to  the  holographic  medium.  Our  model 
thus  does  not  consider  the  effects  of  any  diffraction- 
induced  local-neighborhood  cross  gratings  that  may 
be  present.  Instead,  each  beam  that  reads  out  SLMj 
[referred  to  as  a  h"'-'  beam  above]  is  assumed  to 
propagate  essentially  unchanged  to  the  holographic 
medium,  forming  an  e.xact  (rather  than  an  approxi¬ 
mate)  image  of  SLM,  on  its  front  face.  The  interfer¬ 
ence  of  each  8'""  beam  with  light  from  its  associated 
xj""  pixel  in  SLM_,  therefore  results  in  ten  (for  a 
lO-to-10  interconnection)  distinct  grating  regions  in 
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th^  hokigr{4>hic  medium.  In  each  distinct  region  the 
resulting  interconnection  grating  pattern  has  i  difler* 
ent  amplitude,  proportional  to  *  for  a  single 

training  pair,  while  the  grating  periods  and  slant 
angles  are  the  same  for  all  ten  regions  that  derive 
from  sources^. 

For  the  parameters  assumed  above  for  the  source 
and  pixel  separations,  for  the  focal  length  of  L4,  and 
for  the  separation  between  SLM|  and  the  holographic 
medium,  each  8'""  beam  is  shifted  by  one  pixel  from 
angularly  adjacent  8""'  beams  on  the  face  of  the 
hologram.  The  net  result  is  19  separate  regions  in 
the  holographic  medium,  with  between  1  and  10 
gratings  multiplexed  in  each  region.  The  2-D  formu¬ 
lation  of  the  BPM  discussed  in  Section  3.3  was  used  to 
simulate  readout  of  each  distinct  region. 

Simulation  results  for  the  diffracted  outputs  of  the 
full-aperture  configuration  of  the  incoherent /coherent 
double  angularly  niultiplexed  architecture  are  shown 
in  Fig.  15  as  a  function  of  the  grating  strength  of  the 
largest  interconnection  grating.  The  rms  fidelity 
error  and  throughput  are  shown  in  Fig.  16;  despite 
readout  by  a  set  of  mutually  incoherent  beams,  the 
peak  throu^put  is  nearly  95%. 

As  in  our  fidelity  analysis  of  the  single-source 
architecture,  the  functional  dependence  of  each  weight 
on  its  corresponding  grating  strength  determines  the 
ideal  input/output  proportionality  factor  that  must 
be  known  for  comparison  with  the  actual  input/output 
characteristics  of  the  system.  For  holographic  inter¬ 


connection  systems  the  ideal  input/output  relation¬ 
ship  is  based  on  the  underlying  physics  of  the  diffrac¬ 
tion  process  used  in  the  system. 

For  example,  direct  application  of  the  weight  rela¬ 
tionship  expressed  in  Rel.  (29)  for  a  single-source 
architecture  to  the  fidelity  analysis  of  the  full- 
aperture  configuration  of  the  incoherent/coherent 
double  angvilarly  multiplexed  architectiu'e  suggests 
the  presence  of  significant  fidelity  errors,  as  illus¬ 
trate  in  Figs.  15(c)  and  16(a).  Although  the  percent¬ 
age  errors  in  Fig.  15(c)  are  small  for  low  throughput, 
they  become  quite  large  (up  to  85%)  at  the  peak 
throughput.  Similar  behavior  is  observed  for  the 
rms  fidelity  error  in  Fig.  16(a)  (solid  curve). 

The  reason  for  the  apparent  lack  of  fidelity  is  that 
Rel.  (29)  does  not  adequately  describe  the  physics 
behind  the  diffraction  process  used  in  the  full- 
aperture  configuration.  In  this  case  each  grating  in 
a  given  region  of  the  holographic  material  is  com¬ 
pletely  independent  of  the  other  interconnection  grat¬ 
ings  in  that  region,  except  for  effects  such  as  angular 
sidelobe  overlap.  We  therefore  assume  that  the  dif¬ 
fraction  efficiency  of  any  particular  grating  is  given  by 
sin^(vy/2),  in  which  Vy  is  the  strength  of  this  grating.*^ 
On  this  basis  the  corresponding  weight  relationship 
for  the  full-aperture  configuration  of  the  architecture 
(for  multiple  training  pairs)  is 

Wi;,  a  sin2iv:«'/2],  (36) 

in  which  is  given  by  Eq.  (24). 

Comparison  of  the  diffracted  outputs  obtained  from 


Fig.  15.  Simulation  results  for  the  lO-to-10  inooherent/coherent 
(louble  angularly  multiplexed  architecture  (full-aperture  configura¬ 
tion)  for  readout  with  mutually  incoherent  beams.  Shown  as 
functions  of  the  grating  strength  of  the  largest  grating  are  (a)  the 
diffracted  outputs,  (b)  the  ratios  of  the  diffracted  outputs,  (c)  the 
percentage  error  using  Rel.  (28)  for  the  dependence  of  each  weight 
on  grating  strength,  and  (d)  the  percentage  error  using  ReL  (36)  for 
the  dependence  of  each  weight  on  grating  strength. 


the  BPM  simulations  to  the  ideal  outputs  calculated 
using  Rel.  (36)  (as  shown  in  Figs.  15(d)  and  16(a)] 
jdelds  much  better  measured  fidelity  performance. 
The  actual  diffracted  outputs  from  the  holographic 
medium  are  of  course  not  changed;  rather,  the  metric 
against  which  they  are  compared  is  related  more 
closely  to  the  underlying  diffraction  behavior  of  the 
interconnection  system.  The  weight  definition  of 
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(b) 

Fig.  16.  Simulation  results  for  the  10>to-10  inooherent/coherent 
double  angularly  multiplexed  architecture  (full-aperture  configura¬ 
tion).  Shown  are  (a)  the  rms  fidelity  error  for  two  different 
functional  dependencies  of  the  weights  on  the  grating  strength  and 
(b)  the  optical  throughput. 


Rel.  (36)  as  compiared  with  that  of  Rel.  (29)  implies  a 
different  functional  form  for  the  weight-update  rela¬ 
tionship  in  an  adaptive  system.  The  effect  of  this 
altered  functional  form  on  the  performance  of  learn¬ 
ing  algorithms  is  currently  unknown.  We  conjec¬ 
ture  that,  given  a  physical  implementation  with 
limited  dynamic  range  for  each  weight,  the  soft- 
limiting  characteristic  provided  by  the  sin*  function 
may  prove  to  be  in  some  respects  preferable  to  a 
hard-clipping  saturation  characteristic. 

As  shown  in  Fig.  15(d),  the  percentage  errors  of  the 
ratios  of  the  diffracted  outputs  obtained  using  the 
sin*  weight  relationship  are  very  close  to  zero  for 
throughputs  up  to  and  including  the  peak  throughput 
of  95%.  This  result  shows  that  the  principal  source 
of  error  in  this  configuration  indeed  derives  from  the 
metric  rather  than  from  some  source  of  cross  talk. 
The  comparison  of  the  two  metrics  is  illustrated 
clearly  in  Fig.  16(a),  in  which  the  rms  fidelity  errors 
are  plotted  on  a  log  scade.  The  rms  fidelity  error  for 
the  sin*  metric  is  relatively  flat  between  0  and  4  rad  of 


grating  strength  (at  which  the  peak  throughput  oc¬ 
curs)  and  does  not  go  to  zero  as  v  -*  0.  The  principal 
reason  for  the  nonzero  fidelity  error  throughout  this 
region  appears  to  be  overlap  of  the  auigular  sidelobes 
of  the  gratings  multiplex^  in  each  region  of  the 
hologram.  Increasing  the  number  of  interconnec¬ 
tion  nodes  in  the  incoherent/coherent  double  angu¬ 
larly  multiplexed  architecture  should  lower  this  level 
of  error  by  averaging  out  the  contributions  of  an 
increased  number  of  sidelobes,  as  discussed  in  Section 
4.E.3. 

For  the  lO-to-10  interconnection  system  analyzed, 
the  rms  fidelity  error  shown  in  Fig.  16(a)  with  a 
weight  relationship  given  by  Rel.  (36)  is  dramatically 
snudler  than  for  the  sequentially  recorded  single¬ 
source  architecture  using  either  mutually  coherent  or 
incoherent  readout  beams.  For  example,  at  the  grat¬ 
ing  strengths  corresponding  to  peak  throughput,  the 
error  for  the  incoherent/coherent  double  angularly 
multiplexed  architecture  is  more  than  an  oi^er  of 
magnitude  lower  than  that  for  the  sequentially  re¬ 
corded  case  of  the  single-source  architecture. 

For  completeness  we  applied  the  weight  relation¬ 
ship  of  Rel.  (36)  to  the  fidelity  analysis  of  the  single¬ 
source  architecture  and  found  that  the  fidelity  errors 
were  essentially  the  same  as  shown  in  Fig.  11.  This 
result  is  not  surprising,  because  in  the  sin^e-source 
architecture  the  grating  strength  of  each  individual 
grating  is  small  (0.4  rad  for  the  largest  interconnec¬ 
tion  grating  at  the  peak  throughput),  and  Rel.  (36) 
reduces  to  Rel.  (29)  for  small 

In  certain  volume  holograpmc  media,  such  as  pho- 
torefractive  crystals,  the  presence  of  many  overlap¬ 
ping  incoherent  beams  in  the  full-aperture  configura¬ 
tion  of  the  incoherent/coherent  double  angularly 
multiplexed  architecture  results  in  a  small  modula¬ 
tion  depth  for  each  pair  of  recording  beams,  which  in 
turn  significantly  reduces  the  achievable  optical 
throughput.**  This  problem  may  be  avoided  by  us¬ 
ing  the  subhologram  configuration  of  the  incoherent- 
coherent  double  angularly  multiplexed  architecture, 
which  is  discussed  in  Section  5.C. 

Furthermore,  as  mentioned  in  the  beginning  of 
Section  5,  the  full-aperture  configuration  of  the  inco¬ 
herent-coherent  double  angularly  multiplexed  archi¬ 
tecture  is  additionally  subject  to  the  effects  of  grating 
degeneracy.  By  modifying  the  architecture  slightly 
to  realize  the  subhologram  configuration,  we  can 
potentially  avoid  this  source  of  cross  talk  without 
using  fractal  sampling  grids.  The  trade-off,  how¬ 
ever,  is  permitting  the  presence  of  additional  cross 
gratings.  As  discussed  below,  these  can  in  turn  be 
minimized  by  adjusting  the  beam  splitter  ratio. 

C.  Subhologram  Configuration  of  the 
Incoherent/Coherent  Double  Angularly  Multiplexed 
Architecture:  Operation 

As  shown  in  Fig.  17,  the  subhologram  configuration 
of  the  double  angularly  multiplexed  architecture  can 
be  created  by  inserting  an  additional  lens  (Lg)  be¬ 
tween  SLMi  and  the  holographic  medium.  The  lens 
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if.  17.  Schematic  diagram  of  the  subhologram  configuration  of 
M  incoherent/ooherent  double  angularly  multiplexed  architec- 
jre. 

i  positioned  at  a  distance  of  one  focal  length  from  the 
medium  such  that  it  performs  a  Fourier  transform  of 
he  beams  emerging  from  SLMi  (this  corresponds  to 
ni Hinging  the  souTce  array  onto  the  medium).  In 
ddition,  lens  L3  is  adjusted  to  image  SLMj  onto  the 
ledium,  which  also  effectively  reimages  the  source 
rray  instead  of  collimating  the  li^t  from  each  pixel 
n  SLM2  as  in  the  full-aperture  configuration  of  the 
rchitecture. 

The  optical  system  is  designed  such  that  the  images 
f  the  source  array  through  the  upper  and  lower 
aths  of  the  architecture  are  in  registry  at  the 
olographic  medium  so  that  spatially  distinct  holo- 
rams  (Le.,  subholograms)  are  formed  across  the  face 
f  the  medium  (as  shown  schematically  in  Fig.  18). 
'o  understand  the  nature  of  each  subhologram,  let  us 
Kus  on  only  one  source  Sj  that  is  imaged  onto  the 
ologram  through  both  paths.  The  image  of  Sj 
hrough  the  upper  path  contains  the  Fourier  trans- 
jrm  of  the  image  of  S*"*’  that  is  on  SLMi,  while  the 
nage  of  Sj  through  the  lower  path  has  an  Intensity 
roportional  to  The  interference  between  the 
wo  beams  creates  weighted  interconnections  be- 
ween  the  yth  pixel  in  the  input  plane  and  all  of  the 
raining-plane  pixels.  However,  interference  among 


g.  18.  Schematic  diagram  of  a  subhologram  array.  Each  sub- 
)logram  is  shown  as  spatially  separate  in  this  case. 


the  components  of  the  Fourier  t,'ansfonn  of  6**' 
causes  cross  gratings  that  form  intraplanar  connec¬ 
tions  among  the  training-plane  pixeb.  By  adiusting 
the  beam  flitter  ratio  R,  we  can  decrease  the  ma^i- 
tude  of  these  cross  gratings  relative  to  the  desired 
interconnection  gratings. 

Each  of  the  subholograms  connects  a  single  pixel  in 
the  input  plane  to  all  of  the  pixels  in  the  timning 
plane  and  thus  performs  a  1-to-N  weighted  fan-out 
upon  reconstruction.  As  in  the  full-aperture  config¬ 
uration  of  the  architecture,  an  imaging  lens  is  used 
after  the  holographic  medium  (shown  as  Lj  in  Fig. 
14(a)  j  to  perform  the  fan-in  to  each  node  in  the  output 
plane.  The  subholograms  in  general  will  at  least 
partially  overlap  within  the  volume  holographic  me¬ 
dium.  depending  on  the  focal  lengths  of  the  lenses  and 
the  spacings  of  the  pixels  and  of  the  sources. 
However,  this  spatial  overlap  does  not  cause  addi¬ 
tional  cross  gratings  to  form  during  simultaneous 
exposure  of  the  set  of  subbolograms  because  the 
sources  are  mutually  incoherent  During  recording, 
all  of  the  sources  are  turned  on  simultaneously  su<^ 
that  the  recording  ofA/ training  pairs  requires  only  Af 
exposures,  just  as  in  the  full-aperture  configuration 
of  the  architecture.  Full  illumination  of  both  SLM 
s^rtures  is  accomplished  with  the  entire  source 
array  on.  which  provides  efficient  ]x>wer  transfer  to 
the  holographic  medium  during  ea<^  exposure. 

In  Section  5.D  we  discuss  simulation  results  and 
scaling  trends  for  the  subhologram  configuration  of 
the  double  angularly  multiplexed  architecture. 

O.  Subhologram  Configuration  of  the 
lrKX>herent/Coherent  Double  Angularly  Multiplexed 
Architecture;  Simulation 

In  order  to  determine  the  relationship  between  an 
individual  weight  and  the  strength  of  its  associated 
grating  within  the  subhologram  configuration  of  the 
incoherent-coherent  double  angularly  multiplexed 
architectiue,  we  use  the  fact  that  each  spatially 
s^regated  subhologram  implements  an  independent 
1-to-iV  fan-out.  As  discussed  in  Section  3.B,  an 
analytical  solution  for  the  difiroction  effidenr^  of  a 
1-to-N  weighted  fan-out  has  been  obtained  using 
coupled-wave  theory  under  the  assumption  that  no 
cross  gratings  are  present.®*  The  net  result  is  that 
the  weights  and  grating  strengths  for  the  subholo¬ 
gram  configuration  of  the  incoherent/coherent  dou¬ 
ble  angularly  multiplexed  architecture  are  related  by 
(in  our  notation) 

w;^  a  [vf  p.  (37) 

which  is  the  same  as  Rel.  (29)  for  the  single-source 
architecture.  The  fidelity  analysis  presented  in  this 
section  is  based  on  the  use  of  Rel.  (37)  to  compute  the 
ideal  input/output  characteristics  of  the  subholo¬ 
gram  configuration. 

For  the  simulations  discussed  in  this  section,  the 
weight  matrix  and  the  readout  vector  used  are  the 
same  as  those  described  in  Section  3.C.3.  Also,  the 
parameters  of  the  optical  components  are  chosen  to 
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be  thb  same  as  for  the  fuU-aperture  oonfiguration  of 
the  architecture,  except  that  the  focal  length  <  f  is 
25  mm,  that  of  is  100  mm,  and  that  of  L«  is  50  mm. 
For  the  purposes  of  our  modeling,  the  resultant 
subholograms  ( 500  ttm  in  diameter,  set  on  500-|i.  n 
centers)  r  re  considered  to  be  fully  separated,  and  the 
pixels  in  the  training  plane  are  treated  as  point 
sources.  The  optical  beam  propagation  method  was 
used  to  model  the  readout  of  each  subhologram. 

Simulation  results  are  shown  in  Figs.  19  and  20. 
In  Fig.  19  the  individual  diffracted  outputs  are  com¬ 
puted  for  a  beam  splitter  ratio  R  of  100.  The 
horizontal  axis,  as  above,  refers  to  the  grating  strength 
of  the  largest  interconnection  grating  in  the  holo¬ 
graphic  m^um.  The  ratio  percentage  errors  in  Fig. 
19(c)  are  not  much  different  than  those  shown  in  Fig. 
8(c)  for  sequential  recording  in  the  single-source 
architecture.  For  example,  at  the  peak  tl^u^put 
of  over  95%  {a&  shown  in  Fig.  20(b)]  the  largest  ratio 
error  for  the  subhologram  configuration  of  the 
incoherent/coherent  double  angularly  multiplexed 


Fig.  19.  Simulation  results  for  the  lO-to-10  incoherent/coherent 
double  angularly  multiple.xed  architecture  (subhologram  configura¬ 
tion)  for  readout  with  mutually  incoherent  beams.  Shown  as 
functions  of  the  grating  strength  of  the  largest  grating  are  (a)  the 
diffracted  outputs,  (b)  the  ratios  of  the  diffracted  outputs,  and 
Ic)  the  percentage  error  of  each  ratio. 


20.  (a)  The  nns  fidelity  error  and  (b)  the  throughput  for 
various  beam  splitter  ratios  in  the  lO-to-10  incoherent-coherent 
double  angularly  multiplexed  architecture  (subhologram  con¬ 
figuration).  When  R  «  100,  the  fidelity  error  and  throughput 
approach  the  case  for  which  there  are  no  cross  gratings.  Readout 
is  performed  with  mutually  incoherent  beams. 


architecture  (with  R  =  100)  is  25%;  at  10%  through¬ 
put,  the  largest  ratio  error  is  only  4%. 

The  rms  fidelity  error  for  the  subhologram  configu¬ 
ration  is  shown  in  Fig.  20(a)  for  several  values  of  the 
beam  splitter  ratio,  2uid  the  corresponding  through¬ 
puts  are  shown  in  Fig.  20(b).  When  the  beam  split¬ 
ter  ratio  is  unity  (i.e.,  when  the  cross  gratings  have 
the  same  relative  amplitudes  as  the  desired  intercon¬ 
nection  gratings),  the  cross  gratings  cause  large  fidel¬ 
ity  errors  except  in  the  limit  of  low  throughput 
(similar  to  the  results  obtained  for  the  single-source 
architecture).  As  the  relative  magnitudes  of  the 
cross  gratings  decrease  with  increasing  beam  splitter 
ratios,  the  fidelity  and  throughput  both  improve. 
For  comparison,  a  case  is  also  shown  for  which  no 
cross  gratings  are  present.  In  all  cases,  the  fidelity 
error  asymptotically  approaches  zero  with  decreasing 
grating  strength  because  angular  sidelobe  overlap 
that  might  affect  the  desired  interconnections  does 
not  occur  in  the  subhologram  configuration. 


10  Marcn  1993  Vol.  32.  No.  8  /  APPLIED  OPTICS 


1 465 


In  ordei?  to  examine  how  thef<!  results  scale  with 
e  I  umber  of  interconnection  nodes  in  the  training 
id  input  planes,  we  simulated  a  4-to-4  interconnec- 
m  ^stem  using  the  same  weight  matrix  and  read- 
it  vector  as  in  the  single-source  architecture  4-U>-4 
nutations.  The  fidelity  error  and  throUe'-iput  re- 
Its  are  shown  in  Fig.  21  for  tht  same  beam  splitter 
lios  as  shown  in  Fig.  20.  For  R  —  100  neither  the 
lelity  nor  the  throughput  seems  to  differ  signifi- 
ntly  between  the  lO-to-10  and  the  4-to-4  results, 
ir  the  case  A  =  1,  however,  substantial  improve- 
snt  in  both  fidelity  and  throughput  is  observed  in 
iling  up  to  the  lO-to-lO  interconnection  system 
>m  the  4-^-4  system  (except  at  small  grating 
-engths). 

Discussion  of  the  Incoherent/Coherent  Double 
guiarty  Multiplexed  Architecture  Configurations 
ir  simulations  demonstrate  the  ability  of  the 
»herent/coherent  double  angularly  multiplexed 
idiitecture  to  obtain  high  optical  throu^put  (at 
kst  for  linear  holographic  materials)  when  mutually 


(b) 

21  (a)  The  rms  fidelity  error  and  (b)  the  throughput  for 

)us  beam  splitter  ratios  in  the  4-to-4  incoherent/coherent 
lie  angularly  multiplexed  architecture  (subhologram  configura- 
Ckimptu-ison  with  Fig.  20  indicates  how  fidelity  and  through- 
'ariations  scale  with  the  number  of  interconnection  nodes. 


incoherent  beams  are  used  during  i-eadout  of  the 
holographic  interconnections.  By  avoiding  beam  de¬ 
generacy  in  both  configurations  of  the  architecture, 
we  can  circumvent  the  usual  incoherent  fan-in  loss 
found  in  the  single-source  architecture. 

Our  simulation  results  (presented  in  Section  S.B) 
further  demonstrate  that  high  reoonstrudUon  fidelity 
is  achievable  in  the  full-aperture  configuration  of  the 
inooherent/coherent  double  angularly  multiplexed 
architecture  for  a  fidelity  metric  based  on  the  diffrac¬ 
tion  properties  of  its  interconnection  system.  As 
noted  ateve,  grating  degeneracy  is  present  in  the 
full-aperture  configuration  such  that  fractal  sam¬ 
pling  grids  may  be  required  in  certain  applications. 
By  contrast,  the  subhologram  configuration  of  the 
architecture  avoids  the  presence  of  grating  d^ener- 
acy,  which  may  permit  an  increased  interconnection 
density  for  a  given  physical  ^stem  volume  relative  to 
both  the  full-aperture  configuration  of  the  incoher¬ 
ent/coherent  double  angularly  multiplexed  architec¬ 
ture  and  the  sin^e-source  architecture.  However, 
the  subhologram  configuration  involves  a  fundamen¬ 
tal  trade-off  between  reconstruction  fidelity  and  the 
beam  splitter  ratio  msulting  from  the  presence  of 
cross  gratings  that  are  not  present  in  the  full- 
aperture  configuration. 

An  important  aspect  of  the  subhologram  configura¬ 
tion  is  the  incorporation  of  both  spatial  and  angular 
multiplexing  in  the  holographic  medium  to  obtain 
independence  of  the  interconnectibn  gratings.  In 
the  limiting  case  of  complete  spatial  separation  of  the 
subholograms,  only  spatial  multiplexing  is  used. 
In  this  case,  a  thin  holographic  material  could  in 
principle  be  used  in  the  interconnection  system. 
However,  since  large  numbers  of  interconnections 
(1()*-10*®)  are  anticipated  for  photonic  neural  net¬ 
works,  space-bandwidth  limitations  will  in  general 
necessitate  some  degree  of  subhologram  overlap  in 
order  for  compact  system  implementations  to  be 
realized.  In  this  case,  the  independence  of  the  inter¬ 
connection  gratings  with  nonnegligible  subhologram 
overlap  necessitates  angular  (or  wavelength)  multi¬ 
plexing  to  achieve  Bragg  isolation,  which  in  tiim 
requires  the  use  of  a  thick  holographic  medium. 
The  set  of  overlapping  subhologram  configurations 
spans  the  continuum  between  the  full-aperture  and 
full-subhologram  configurations  and  as  such  may 
yield  an  optimum  compromise  between  these  two 
extremes.  In  fact,  the  optimal  degree  of  subholo¬ 
gram  overlap  may  well  prove  to  be  material  depen¬ 
dent. 

6.  Comparison  of  Holographic  interconnection 
Techniques 

In  this  section  we  compare  and  contrast  the  full- 
aperture  and  subhologram  configurations  of  the 
incoherent/coherent  double  angularly  multiplexed 
architecture  with  the  single-source  architecture  con¬ 
figured  using  different  recording  methods. 

The  fidelity  and  throughput  performance  of  the 
single-source  architecture  and  of  both  configurations 
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.  of  th^  in  :oherent/coherent  double  angularly  mulU- 
plexed  architecture  are  sununarized  in  Fig-  22.  in 
which  the  rms  fidelity  errors  for  the  lO-to-lO  simula¬ 
tions  are  shown  as  a  function  of  optical  throughput 
instead  of  grating  strength.  For  each  curve  in  the 
figure  the  right-hand  end  point  represents  the  peak 
throughput  achieved  in  the  simulation  for  the  partic¬ 
ular  interconnection  architecture  and  recording 
method  to  which  that  curve  corresponds. 

As  a  result  of  the  effects  of  cross  gratings,  use  of  the 
simultaneous  recording  method  in  the  single-source 
architecture  yields  both  poor  reconstruction  fidelity 
and  a  peak  throughput  of  50%  (for  the  case  simulated). 
For  a  beam  splitter  ratio  of  100  (the  same  as  shown 
for  the  subhologram  configuration  of  the  inco¬ 
herent/coherent  double  angularly  multiplexed  archi¬ 
tecture)  the  pagewise-sequential  recording  method 
does  not  yield  significant  performance  improvement. 
As  mentioned,  ip  Section  4.B.  a  serious  drawback  foi 
the  use  of  this  recording  method  is  that  the  beam 
splitter  ratio  required  for  a  given  level  of  fidelity  error 
increases  quadratically  with  the  number  of  nodes  in 
the  interconnection  system.  When  all  of  the  cross 
gratings  are  eliminated  in  the  sin^e-source  architec¬ 
ture  by  using  a  sequential  recording  technique,  both 
hig^  throughput  and  good  reconstruction  fidelity  are 
achievable. 

As  the  pagewise-sequential  and  fully  sequential 
recording  methods  within  the  single-source  architec¬ 
ture  require  a  significantly  larger  number  of  exposure 
steps  per  training  pair  and  greater  hardware  complex¬ 
ity  than  the  simultaneous  recording  method,  they  are 
potentially  less  attractive  options  for  implementation 
of  large-scale  adaptive  neural-network  systems. 


Fig.  22.  Rms  error  as  a  function  of  the  throughput  for  the 
single-source  interconnection  architecture  (parameterized  by  re¬ 
cording  method)  and  for  the  two  configurations  of  the  inco¬ 
herent/coherent  double  angularly  multiplexed  architecture.  In 
all  cases,  the  single-source  architecture  is  read  out  with  mutually 
coherent  beams  and  the  incoherent/coherent  double  angularly 
multiplexed  architecture  Is  read  out  with  mutually  incoherent 
beams.  The  cun-e  for  the  full-aperture  configuration  of  the 
incoherent/coherent  double  angularly  multiple.xed  architecture 
lies  almo.st  directly  on  the  horizontal  axis. 


Alternatively,  simultaneous  recording  in  a  single¬ 
source  architecture  suffers  from  a  lack  of  reconstruc¬ 
tion  fidelity  for  significant  optical  throughput.  If 
low  throughput  is  tolerable  in  a  given  computational 
architecture,  or  if  the  effects  of  the  cross  gratings  can 
be  minimized  by  using  the  spatial-frequency-sensi¬ 
tive  properties  of  a  particular  holographic  material,  a 
single-source  architecture  with  mutually  coherent 
readout  beams  and  an  appropriate  fractal  sampling 
grid  becomes  a  viable  option. 

As  illustrated  in  Fig.  22,  both  configurations  of  the 
incoherent/coherent  double  angularly  multiplexed 
architecture  can  achieve  both  high  fidelity  and  high 
optical  throughput  while  using  simultaneous  record¬ 
ing  (with  a  large  enough  beam  splitter  ratio  in  the 
case  of  the  subhologram  configuration).  High  opti¬ 
cal  throughput  proves  to  be  obtainable  (in  a  linear 
holographic  medium)  despite  the  use  of  mutually 
incoherent  readout  beams  because  each  configuration 
avoids  the  presence  of  beam  degeneracy. 

In  addition  to  providing  for  linear  summation  of 
the  diffracted  output  intensities,  readout  with  mutu¬ 
ally  incoherent  beams  in  the  incoherent/coherent 
double  angularly  multiplexed  architecture  avoids  (dur¬ 
ing  operation,  not  training)  the  rigid  optical  phase 
stability  requirements  needed  in  a  single-source  archi¬ 
tecture  that  is  read  out  with  mutually  coherent 
beams.  This  feature  reduces  the  degree  of  vibration 
isolation  required  and  hence  increases  the  practical¬ 
ity  of  operating  a  trained  photonic  neural  network  in 
an  industrial  or  field  environment. 

A  further  advantage  of  the  incoherent/coherent 
double  angularly  multiplexed  architecture  is  that  the 
interconnection  gratings  in  the  volume  holographic 
medium  can  be  copied  into  a  second  volume  holo¬ 
graphic  recording  medium  in  a  sin^e  recording  step.®' 
For  example,  the  full  set  of  interconnections  that  are 
learned  in  a  primary  adaptive  system  can  easily  be 
reproduced  in  any  number  of  secondary  permanent 
holographic  media  for  operational  use.  In  contrast, 
direct  single-step  copying  of  an  interconnection  pat¬ 
tern  within  the  single-source  architecture  is  not 
possible  without  sacrificing  either  interconnection 
fidelity  or  optical  throughput.  Instead,  it  appears 
that  at  least  N  (if  not  N^)  exposure  steps  are  required 
for  duplication  of  an  N-to-N  interconnection  system 
within  a  single-source  architecture. 

As  mentioned  in  Section  5.E,  greater  interconnec¬ 
tion  densities  may  be  achievable  if  grating  degeneracy 
(and  hence  the  use  of  fractal  sampling  grids)  can  be 
avoided.  Of  the  interconnection  techniques  dis¬ 
cussed  herein,  only  the  subhologram  configuration  of 
the  incoherent/coherent  double  angularly  multi¬ 
ple.xed  architecture  offers  the  potential  of  avoiding 
the  presence  of  grating  degeneracy. 

In  this  paper  we  have  quantitatively  evaluated  the 
performance  characteristics  of  the  incoherent' 
coherent  double  angularly  multiple.xed  architecture 
(based  on  the  use  of  an  array  of  individually  coherent 
but  mutually  incoherent  sources)  for  highly  multi¬ 
plexed  volume  holographic  interconnection  ap- 
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plications.''  In  addition,  wc  have  quantitatively  eval- 

«uatcd  the  directly  comparable  performance  character¬ 
istics  of  conventional  single-source  architectures. 
'  As  discussed  below,  there  arc  several  clear  directions 
.for  continuing  research. 


7.  Future  Research  Directions 
■|ln  this  study  we  have  performed  a  detailed  compari- 
Ison  of  a  number  of  holographic  interconnection  archi- 
^tectures  that  can  be  used  to  implement  weighted 
interconnections  with  a  high  degree  of  fan-out  and 

Kan-in.  For  the  most  part,  this  comparison  has  been 
nade  on  the  basis  of  interconnection  pathway  indepen¬ 
dence  (lack  of  cross  talk)  and  insertion  loss  (optical 
throughput  efficienQr).  For  the  neural-network  ap- 
Bslication  in  particular,  it  would  be  of  considerable 
Bnterest  to  determine  the  appropriate  levels  of  inter¬ 
channel  isolation  and  insertion  loss  permissible  in  the 

Entext  of  particular  learning  models  without  compro- 
ising  overall  system  performance.  In  other  v  ords, 
what  degree  are  certain  neural-network  models 
sensitive  (or  insensitive)  to  these  effects,  if  we  rely  to  a 
■Greater  or  lesser  extent  on  the  learning  capacity  of  the 
Hietwork  to  obviate  the  necessity  for  ideal  interconnec¬ 
tion  behavior?  Preliminary  experimental  and  theo¬ 
retical  studies  suggest  the  ability  of  some  learning 

Egorithms  to  overcome  a  certain  degree  of  cross  talk 
the  interconnection  ^stem.^-^  but  a  more  compre¬ 
hensive  study  of  this  issue  is  necessa^. 
a  The  simulation  studies  presented  herein  should  be 
Kxpanded  to  evaluate  the  additional  limitations  im- 
^>osed  by  the  effects  of  self-diffraction  among  the 
recording  beams,  grating  erasure,  exposure  schedul- 

Kng,  finite  pixel  size,  and  finite  range  of  grating- 
itrength  modulation  on  both  reconstruction  fidelity 
and  throughput,  particularly  as  the  number  of  inter¬ 
connections  is  increased.  Significantly  increasing 

Bhe  number  of  interconnection  nodes  considered  in 
his  analysis  will  enable  scaling  trends  for  the  relative 
errors  of  each  architecture  to  be  further  identified 
^md  compared.  Inclusion  of  the  grating  recording 
Characteristics  of  photorefractive  media  in  the  holo¬ 
graphic-recording  model  will  permit  the  effects  of 
material  nonlinearities  to  be  determined  and  the 
^tility  of  these  materials  for  the  implementation  of 
Adaptive  photonic  neural  networks  to  be  evaluated, 
t^urthermore,  extension  of  the  BPM  simulations  to 
three  dimensions  will  permit  verification  of  the  trends 

Kbserved  using  a  two-dimensional  model  eind  will  also 
ermit  direct  investigation  of  grating-degeneraej'  ef¬ 
fects. 

tin  addition  to  further  modeling  studies,  previous 
iboratory  work  that  has  confirmed  the  basic  features 
f  the  incoherent/coherent  double  angularly  multi¬ 
plexed  architecture'*  can  be  expanded  to  include  a 
■nore  detailed  study  of  various  implementation  is- 
■ues,  particularly  as  applied  to  photorefractive  media, 
such  issues  include  the  quaintitative  comparison  of 
experimentally  determined  fidelity  errors  and  optical 

throughput  losses  with  simulation  results;  the  effects 
f  scale-up  in  the  number  of  nodes  on  interconnection 


perfonnanoe;  the  effects  of  su  'ihologram  overlap  and 
the  beam  splitter  ratio  on  reoonstructi<  n  fidelity  and 
throughput;  and  continued  device  development  that 
will  permit  the  eventual  integration  of  mutually 
compatible  source  arrays,  neuron-unit  arrays,  and 
volume  holographic  media  into  a  practical  ^stem. 
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